β βBasics of Sequence Models
Sequences are data structures where each example could be seen as a series of data points, for example π§:
Since we have labeled data X and Y so all of these tasks are addressed as Supervised Learning π©βπ«
Even in Sequence-to-Sequence tasks lengths of input and output can be different β
Machine learning algorithms typically require the text input to be represented as a fixed-length vector π
Thus, to model sequences, we need a specific learning framework able to:
β Deal with variable-length sequences
β Maintain sequence order
β Keep track of long-term dependencies rather than cutting input data too short
β Share parameters across the sequence (so not re-learn things across the sequence)