πGeneral Concepts
General Concepts of Sequence Models
π©βπ« Notation
In the context of text processing (e.g: Natural Language Processing NLP)
Symbol | Description |
$$X^{}$$ | The |
$$Y^{}$$ | The |
$$X^{(i)}$$ | The |
$$Y^{(i)}$$ | The |
$$T^{(i)}_x$$ | The length of the |
$$T^{(i)}_y$$ | The length of the |
π One Hot Encoding
A way to represent words so we can treat with them easily
π Example
Let's say that we have a dictionary that consists of 10 words (π€) and the words of the dictionary are:
Car, Pen, Girl, Berry, Apple, Likes, The, And, Boy, Book.
Our $$X^{(i)}$$ is: The Girl Likes Apple And Berry
So we can represent this sequence like the following π
By representing sequences in this way we can feed out data to neural networks β¨
π Disadvantage
If our dictionary consists of 10,000 words so each vector will be 10,000 dimensional π€
This representation can not capture semantic features π
Last updated