Alright, in the previous post we have learned to tokenize and sequence the tokens from a sentence. We can observe that the length of tokens differ.
We need to make sure that inputs are of the same length. Padding saves us from this problem!
data:image/s3,"s3://crabby-images/7abda/7abda0baf7fcdf6a64740f06f2ec5cacfc427917" alt=""
‘pad_sequences’ padded the sequences into the same length. You can observe that 0’s are padded in the beginning of a list which is smaller in size.
‘pad_sequence’ can be used to pad a sentence in the end or in the beginning, or padding a sentence to a desired length by truncating the sequence…
data:image/s3,"s3://crabby-images/6f77c/6f77c2c0f850f7554cdcebae6f35c3217f95ca71" alt=""
Okay, that’s enough! In the next post we will handle a real dataset by applying the techniques we have learned!
댓글