You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When reproducing results, I want to find data/promoter_design/make_genome_memmap.py to generate .mmap file for human reference when generating promoter sequence, but didn't find it. Thanks!
The text was updated successfully, but these errors were encountered:
Hi, great work. Seconding the original question. The file data/promoter_design/make_genome_memmap.py is not present. Can you please upload it to reproduce the results? Cheers!
I added make_genome_memmap.py file into directory external.py. make_genome_memmap.py requires selene_utils.py file. The usage of this file is the following:
python make_genome_memmap.py <path_to_fasta>
The script creates file with one-hot encoding of a reference genome provided in fasta file.
To speed up genome sequence retrieval, the training scripts use a memory-mapped genome file. Run python misc/make_genome_memmap.py before you use the training scripts (47Gb space needed).
Solution 2
Alternative solution is not to use a memory-mapped genome. For doing so, one can replace code of genome dataloader from:
Hi!
This is an interesting work!
When reproducing results, I want to find data/promoter_design/make_genome_memmap.py to generate .mmap file for human reference when generating promoter sequence, but didn't find it. Thanks!
The text was updated successfully, but these errors were encountered: