Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding Masked Language Modelling #1030
Adding Masked Language Modelling #1030
Changes from 1 commit
430f942
39603c3
9b324f9
d3cc769
00bc40c
4e297b1
b75d0f5
1aadf48
8993b9e
a3f10e2
aa0d8b4
275d7a3
4b6b939
7252ea5
f0d9c56
00223c6
b0a8ec3
c4d2601
acb9d24
8bdec95
0d879b1
4f0a169
5bb8389
fb59ecc
04dbbda
3ddf564
e588909
dfa9fd9
46182a9
607bcd2
d1daf23
9539302
fc5f026
ce7f5c2
ffc7354
c50d75b
9649224
69a9364
697d62c
a4666da
fa13f6f
6b61e8b
3e10e3b
afc0938
dcff7e7
1c1e6fb
f25ee99
3f35212
fc85270
321bda8
ae92b78
4f36878
0467871
e3c5c79
6e96fd0
845bf4f
b41c268
6f82412
c749ea7
73222a5
fe39525
745836d
14caaab
4271a7a
918c0df
5ed0691
ffac8bf
fe86d96
eee439f
b61fa7c
55312e8
7d165cf
52f66c7
a0220f8
fe89674
2218e5b
cd75715
58b2914
052b1c0
b26927a
cd4b5a6
0d6d691
b3617fa
0af6476
e9f863c
89e44c5
0752771
5482ac2
6d85b27
d67e195
921e717
05d5750
ddcd357
9e4e3a7
b9b5f57
6b4c9d5
35130ca
20de779
4020c81
09f5903
4f45826
8ac8c70
6cee66e
3709696
3fb4e3e
a56b7c7
b84da1d
2a19c2c
e7acb76
b1ac702
c5fddf0
9774b61
194c2d4
4be35b3
429be9a
a9555b1
13002f6
9e6bc5d
a0aad25
85db63e
4536433
85b081b
eabe292
09caf0f
a7f8f16
94c32ae
74d474b
1d20684
12c0da1
960cf63
f0d3b6d
cd2042e
cf7612a
f21165c
1f25078
fec67cb
c766706
1a5c06b
5c3ff7b
8ca1eba
9b377ab
bf841e9
2349464
cf223a4
a3465c1
0f5b849
fb9ce83
a59c762
e9eb5f0
1a76df0
34c924b
da5fe19
9446cb7
3f6eb92
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@pyeres here's the instructions for data generation (point 2 you raised).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think my request got buried a ways back in the thread — resurfacing here:
We need to make this Task reproducible, and to do that I don't know of a way to get around making the task's data dependencies reproducible. This can be done w/ a script or instructions (if the instructions are involved they should be in a script).
A script doesn't need to copy the functionality of other open source scripts involved in generating the data — our script can simply document that the other steps/scripts are used at some step. The goal is that using our script/instructions the user should be able to exactly reproduce the task's data dependencies. (@phu-pmh for visibility).