🔄 LieRE: Generalizing Rotary Position Encodings

🔍 Overview

While Rotary Position Embeddings (RoPE) have become widely adopted for large language models, their application across other modalities has been limited. This repository introduces Lie group Relative position Encodings (LieRE), which extends beyond RoPE to support n-dimensional inputs.

📈 Performance Highlights

LieRE demonstrates significant improvements compared to baselines (DeiT III, RoPE-Mixed, and Vision-Llama):

🎯 Accuracy: Up to 9.7% increase for 2D tasks and 25.5% for 3D tasks compared to DeiT III (absolute position encoding)
⚡ Training Efficiency: 3.5x reduction in training time compared to DeiT III (absolute position encoding)
📊 Data Efficiency: 30% improvement in data utilization compared to DeiT III (absolute position encoding)

💻 Implementation Details

LieRE represents rotation matrices as the matrix exponential of skew-symmetric matrices, making them learnable by parametrizing the rotations with generators before applying the matrix exponential.

# Core Implementation
generator_raw_params = nn.Parameter(
    torch.rand(
        input_dimensionality,
        head_dim,
        head_dim,
    ) * 2 * math.pi
)
upper_triangle = (
    torch.triu(generator_raw_params, diagonal=1)
)
skew_bases = upper_triangle - torch.transpose(upper_triangle, -1, -2)
in_basis_positions = (
    positions.reshape(list(positions.shape) + [1] * 2) * skew_bases
)
generator_pos = torch.sum(in_basis_positions, dim=-3)
rotation = torch.matrix_exp(generator_pos.to(dtype=torch.float32)).to(dtype=positions.dtype)

🚀 Getting Started

📦 Dependencies

PyTorch >= 1.7.0
PyTorch Lightning
torchvision

🔧 Installation

git clone https://github.com/yourusername/liere-position-encodings.git
cd liere-position-encodings
pip install -r requirements.txt

🏃‍♂️ Usage

To reproduce the CIFAR-100 results, use:

sbatch -c 48 --gres=gpu:l40:4 --nodelist=rae1 --time=00:00:00 lightning_cifar100.sh

You can compare different position encoding methods using the following options:

liere: Our proposed method
rope_mixed: RoPE-Mixed baseline
absolute: Standard absolute position encoding
visionllama: Vision-Llama encoding

🧩 Base Repository

This implementation builds upon the transformer implementation and hyperparameters from vision-transformers-cifar10.

📝 Citation

If you find this work useful, please cite:

@article{ostmeier2024liere,
  title={LieRE: Generalizing Rotary Position Encodings},
  author={Ostmeier, Sophie and Axelrod, Brian and Moseley, Michael E and Chaudhari, Akshay and Langlotz, Curtis},
  journal={arXiv preprint arXiv:2406.10322},
  year={2024}
}

⚖️ License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Much of the code was branched from vision-transformers-cifar10
Thanks to all contributors and supporters of this research

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.vscode		.vscode
dataset		dataset
lightning_conf		lightning_conf
lightning_modules		lightning_modules
models		models
test_outputs		test_outputs
tests		tests
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
liere_sketch.png		liere_sketch.png
lightning_base.sh		lightning_base.sh
lightning_cifar100.sh		lightning_cifar100.sh
lightning_imagenet.sh		lightning_imagenet.sh
lightning_main.py		lightning_main.py
lightning_ucf101.sh		lightning_ucf101.sh
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔄 LieRE: Generalizing Rotary Position Encodings

🔍 Overview

📈 Performance Highlights

💻 Implementation Details

🚀 Getting Started

📦 Dependencies

🔧 Installation

🏃‍♂️ Usage

🧩 Base Repository

📝 Citation

⚖️ License

🙏 Acknowledgments

About

Releases

Packages

Contributors 2

Languages

License

Stanford-AIMI/LieRE

Folders and files

Latest commit

History

Repository files navigation

🔄 LieRE: Generalizing Rotary Position Encodings

🔍 Overview

📈 Performance Highlights

💻 Implementation Details

🚀 Getting Started

📦 Dependencies

🔧 Installation

🏃‍♂️ Usage

🧩 Base Repository

📝 Citation

⚖️ License

🙏 Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages