How to cache my mixture #69

renmengjie7 · 2023-05-02T13:22:41Z

I noticed the annotation

# If you're using Seqio, we suggest caching your mixture as they take a while to generate.

But I don't know how to do it

shayne-longpre · 2023-05-08T19:06:10Z

Here are some resources, and there should be more info in the documentation: https://github.com/google/seqio#optional-offline-caching.

This caching is for if you are using the same vocabulary as T5. If you want to train a different model, stream the output and save their pretokenized text format, as we do in run_example.py.

sunyi06200 · 2023-07-13T07:39:55Z

So where can add a seqio.CacheDatasetPlaceholder(required=False)?Or is there a more detailed steps in how to use the five original submix you provided, when I've downloaded them.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to cache my mixture #69

How to cache my mixture #69

renmengjie7 commented May 2, 2023 •

edited

Loading

shayne-longpre commented May 8, 2023

sunyi06200 commented Jul 13, 2023 •

edited

Loading

How to cache my mixture #69

How to cache my mixture #69

Comments

renmengjie7 commented May 2, 2023 • edited Loading

shayne-longpre commented May 8, 2023

sunyi06200 commented Jul 13, 2023 • edited Loading

renmengjie7 commented May 2, 2023 •

edited

Loading

sunyi06200 commented Jul 13, 2023 •

edited

Loading