- 2024-02: 🤗 We released the datasets introduced by GIMMICK on HuggingFace
- 2024-02: 📝 We released the first preprint of the GIMMICK Benchmark paper.
Tested only on Debian-based machines
Note that most models are loaded in FP16 or BF16 and Flash Attention 2. Make sure your GPU supports these features_
- Install mamba
mamba env create -f environment.yml
mamba activate gimmick && pip install flash-attn --no-build-isolation
src/scripts/check_env.sh
If the last step was successful, it should print something like this:
##################### PYTHON ENV 'gimmick' INFO START #####################
Python version: 3.12.8 | packaged by conda-forge | (main, Dec 5 2024, 14:24:40) [GCC 13.3.0]
PyTorch version: 2.5.1
CUDA available: True
CUDA version: 11.8
CUDA devices: 1
Flash Attention 2 Support: True
Transformers version: 4.48.1
##################### PYTHON ENV 'gimmick' INFO END #####################
Run PYTHONPATH=src python src/gimmick/main.py models
to print the list of supported models.
- Qwen2 VL 7B 🤗/qwen/qwen2-vl-7b-instruct
- Qwen2 VL 72B 🤗/qwen/qwen2-vl-72b-instruct
- Phi 3.5 Vision 🤗/microsoft/phi-3.5-vision-instruct
- MiniCPM V 2.6 🤗/openbmb/minicpm-v-2_6
- Llama 3.2 11B Vision 🤗/meta-llama/llama-3.2-11b-vision-instruct
- InternVL2.5 1B 🤗/opengvlab/internvl2_5-1b
- InternVL2.5 2B 🤗/opengvlab/internvl2_5-2b
- InternVL2.5 4B 🤗/opengvlab/internvl2_5-4b
- InternVL2.5 8B 🤗/opengvlab/internvl2_5-8b
- InternVL2.5 26B 🤗/opengvlab/internvl2_5-26b
- InternVL2.5 38B 🤗/opengvlab/internvl2_5-38b
- InternVL2.5 78B 🤗/opengvlab/internvl2_5-78b
- Centurio Aya 🤗/wuenlp/centurio_aya
- Centurio Qwen 🤗/wuenlp/centurio_qwen
- Qwen2.5 0.5B 🤗/qwen/qwen2.5-0.5b-instruct
- Qwen2.5 1.5B 🤗/qwen/qwen2.5-1.5b-instruct
- Qwen2.5 3B 🤗/qwen/qwen2.5-3b-instruct
- Qwen2.5 7B 🤗/qwen/qwen2.5-7b-instruct
- Qwen2.5 32B 🤗/qwen/qwen2.5-32b-instruct
- Qwen2.5 72B 🤗/qwen/qwen2.5-72b-instruct
- InternLM2.5 1.8B 🤗/internlm/internlm2_5-1_8b-chat
- InternLM2.5 7B 🤗/internlm/internlm2_5-7b-chat
- InternLM2.5 20B 🤗/internlm/internlm2_5-20b-chat
- Aya Expanse 8B 🤗/cohereforai/aya-expanse-8b
- Phi 3.5 Mini 🤗/microsoft/phi-3.5-mini-instruct
- GPT-4o Mini gpt-4o-mini-2024-07-18
- GPT-4o gpt-4o-2024-11-20
- Gemini Flash gemini-1.5-flash-002
- Gemini Pro gemini-1.5-pro-002
- Claude 3.5 Sonnet claude-3-5-sonnet-20241022