Skip to content

change data to sequences processing#3

Open
TatianaShavrina wants to merge 1 commit intoChunML:masterfrom
TatianaShavrina:patch-1
Open

change data to sequences processing#3
TatianaShavrina wants to merge 1 commit intoChunML:masterfrom
TatianaShavrina:patch-1

Conversation

@TatianaShavrina
Copy link

All the lines having smth like data[i*seq_length:(i+1)*seq_length] actually get 1/seq_length of the training data, as they convert data to disjoint sequences of seq_length length. Proposing a small enhancement - to gather all char sequences from the data, with step 1. That will slower the training, but will correct the monstrous loss of data.

All the lines having smth like data[i*seq_length:(i+1)*seq_length] actually get 1/seq_length of the training data, as they convert data to disjoint sequences of seq_length length. Proposing a small enhancement - to gather all char sequences from the data, with step 1. That will slower the training, but will correct the monstrous loss of data.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant