General Concepts
General Concepts of Sequence Models
π©βπ« Notation
In the context of text processing (e.g: Natural Language Processing NLP)
Symbol
Description
$$X^{}$$
The t
th word in the input sequence
$$Y^{}$$
The t
th word in the output sequence
$$X^{(i)}$$
The t
th word in the i
th input sequence
$$Y^{(i)}$$
The t
th word in the i
th output sequence
$$T^{(i)}_x$$
The length of the i
th input sequence
$$T^{(i)}_y$$
The length of the i
th output sequence
π One Hot Encoding
A way to represent words so we can treat with them easily
π Example
Let's say that we have a dictionary that consists of 10 words (π€) and the words of the dictionary are:
Car, Pen, Girl, Berry, Apple, Likes, The, And, Boy, Book.
Our $$X^{(i)}$$ is: The Girl Likes Apple And Berry
So we can represent this sequence like the following π
By representing sequences in this way we can feed out data to neural networks β¨
π Disadvantage
If our dictionary consists of 10,000 words so each vector will be 10,000 dimensional π€
This representation can not capture semantic features π
Last updated