-
Notifications
You must be signed in to change notification settings - Fork 13
Open
Description
Thanks for the nice work. We are trying to make your work into an ML problem set for our class.
I have a question about one-hot encoding code here
Line 35 in 87014f4
| if i-motlen+1<len(sequence) and sequence[i-motlen+1]=='N' or i<motlen-1 or i>len(sequence)+motlen-2: |
It seems that the one-hot encoding code set the first few and last few sequences to 0.25. And the length of the sequence that is set to 0.25 is equal to motiflen, I wonder what is the reason for that. I also read the paper, but did not see an explanation for this choice. Is this something standard to do, and where can I read more about this?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels