-
Notifications
You must be signed in to change notification settings - Fork 225
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
#85 Hotfix/RobertaTokenizerFast
object has no attribute max_len
#86
#85 Hotfix/RobertaTokenizerFast
object has no attribute max_len
#86
Conversation
e89eaa3
to
37e68da
Compare
I checked
Looks like everything works as before for |
Hi @kirzharov |
Hi @felixgwu, thanks! Did you mean about only |
When I ran |
@felixgwu Ok, thanks! I'll check today 👍 |
Hi @kirzharov, we have fixed the bug and merged your PR. FYI, the mismatch comes from the fact that huggingface's fast tokenizers are inconsistent with the old tokenizers and create different tokens. We disable the fast tokenizers currently. Thanks again for your help! |
@felixgwu, awesome, thanks! It's great that you managed to fix this quickly. 🔥 I also noticed difference with new Fast tokenizers and |
Hi, Thanks, |
Hi @kalyankumarp, |
Hi Felix, It's working. Thanks for the fix. |
Link to the issue: #85