FACTS

PyTorch Implementation of "FACTS: A Factored State-Space Framework For World Modelling" (accepted at ICLR 2025)

Installation:

Install your dependencies, especially pytorch:
```
     'pytorch>=2.1.0'
     'einops>=0.8.0'
```
You may find Conda a useful tool for managing your virtual environment!
In a terminal, cd to FACTS/, and run pip install -e .
(Optional) Test your installation:
Under FACTS/, run
```
    . _tests_/scripts/installation_test.sh
```
If you see Good to go!, you are Good to go!

Usage:

We provide only three examples to show its usage, for now, more details and DEMOS will be released later (with a more efficient, Triton-powered implementation). Stay tuned, until then...

Example 1 (very basic):

    import torch
    from facts_ssm import FACTS

    facts=FACTS(
        in_features=32,
        in_factors=128,  # M
        num_factors=8,  # K
        slot_size=32,  # D
    ).to('cuda')

    X = torch.randn(4, 30, 128, 32).to('cuda')  # [batch, seq_len, M, D]
    y, z = facts(X)   # [batch, seq_len, K, D], [batch, seq_len, K, D]
    print(f"Output y:  {y.size()}")
    print(f"Output z:  {z.size()}")

Example 2 (flexible customisation):

    import torch
    from facts_ssm import FACTS

    facts=FACTS(
        in_features=32,
        in_factors=128,  # M
        num_factors=128,  # K
        slot_size=32,  # D
        num_heads=4,  # multi-head FACTS
        dropout=0.1,  # dropout
        C_rank=32,  # set to D to use the C proj in SSMs
        router='sfmx_attn',  # router customisation  
        init_method='learnable',  # choose a RNN memory init method
        slim_mode=True,  # Turn on to save ~25% params
        residual=True  # Support only M==K
    ).to('cuda')

    X = torch.randn(4, 30, 128, 32).to('cuda')  # [batch, seq_len, M, D]
    y, z = facts(X)   # [batch, seq_len, K, D], [batch, seq_len, K, D]
    print(f"Output y:  {y.size()}")
    print(f"Output z:  {z.size()}")

Example 3 (semi-parallel RNNs, i.e. chunking):

    import torch
    from facts_ssm import FACTS

    facts=FACTS(
        in_features=32,
        in_factors=128,  # M
        num_factors=128,  # K
        slot_size=32,  # D
        num_heads=4,  
        dropout=0.1,  
        C_rank=32,  # set to D to allow output proj. C
        fast_mode=False,  # mute full-length parallel scan
        chunk_size=16  # parallel within the chunk, sequential across chunks
    ).to('cuda')

    X = torch.randn(4, 30, 128, 32).to('cuda')  # [batch, seq_len, M, D]
    y, z = facts(X)   # [batch, seq_len, K, D], [batch, seq_len, K, D]
    print(f"Output y:  {y.size()}")
    print(f"Output z:  {z.size()}")

Demos:

See Multivariate Time Series Forecasting (MTSF)
For better efficiency, we are working on a triton implementation of the ssm scan operation. We now provide an experimental version.
coming soon ...

Contact

We constantly respond to the raised ''issues'' in terms of running the code. For further inquiries and discussions (e.g. questions about the paper), email: linanbo2008@gmail.com.

Citation

If you find this code useful, please reference in your paper:

@article{nanbo2024facts,
  title={FACTS: A Factored State-Space Framework For World Modelling},
  author={Nanbo, Li and Laakom, Firas and Xu, Yucheng and Wang, Wenyi and Schmidhuber, J{\"u}rgen},
  journal={arXiv preprint arXiv:2410.20922},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
_tests_		_tests_
assets		assets
facts_ssm		facts_ssm
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FACTS

Installation:

Usage:

Demos:

Contact

Citation

About

Releases

Packages

Languages

License

NanboLi/FACTS

Folders and files

Latest commit

History

Repository files navigation

FACTS

Installation:

Usage:

Demos:

Contact

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages