decoder-model

Star

Here are 21 public repositories matching this topic...

shivendrra / SmallLanguageModel

Star

a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model

machine-learning transformer neural-networks gpt bert-model decoder-model llms llm-training llm-cookbook

Updated Jun 25, 2024
Jupyter Notebook

logic-OT / Decoder-Only-LLM

Star

This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context

nlp computer-vision deep-learning inference transformer attention-mechanism decoder-model large-language-models llm small-models

Updated Aug 27, 2024
Jupyter Notebook

partarstu / transformers-in-java

Star

Experimental project for AI and NLP based on Transformer Architecture

java nlp ai transformers transformer dl4j encoder-decoder-model self-attention encoder-network decoder-model samediff

Updated Jan 1, 2024
Java

LaurentVeyssier / Image-Captioning-Project-with-full-Encoder-Decoder-model

Star

Generate caption on images using CNN Encoder- LSTM Decoder structure

encoder pytorch lstm image-captioning bleu-score rnn-encoder-decoder caption-generation rnn-lstm decoder-model

Updated Aug 26, 2020
Jupyter Notebook

SharathHebbar / Transformers

Star

Transformers Intuition

transformers embeddings semantic-similarity sequence-to-sequence tokenization attention-is-all-you-need encoder-decoder-model encoder-model decoder-model masked-language-models causal-language-modeling

Updated May 12, 2025
Jupyter Notebook

aiden200 / GPT3_Implementation

Star

Implementation of the GPT-3 paper: Language Models are Few-Shot Learners

machine-learning transformer gpt-3 decoder-model llm

Updated Mar 10, 2025
Python

KempnerInstitute / minOLMo

Star

An explainable and simplified version of OLMo model

transformer decoder-model olmo

Updated Mar 5, 2025
Jupyter Notebook

shivendrra / enigma

Star

a dna sequence generation/classification using transformers

transformer seq2seq sequence-to-sequence gpt dna-sequences dna-sequencing encoder-decoder-model bert-model decoder-model llm dna-bert

Updated Jan 20, 2025
Jupyter Notebook

edwinthomas444 / cheese_advertisement_generator

Star

An LLM based tool for generation of cheese advirtisements

text-generation data-to-text encoder-decoder-architecture advertisement-generation generative-modeling decoder-model large-language-models

Updated Jan 2, 2024
Jupyter Notebook

DaniyalAhmedKhan1234 / Academic-Text-Simplification

Star

This project aims to simplify texts from research papers using advanced natural language processing (NLP) techniques, making them more accessible to a broader audience

embeddings text-processing attention-mechanism research-paper nlp-machine-learning bleu-score paraphrase-generation sari decoder-model

Updated Jul 29, 2024
Jupyter Notebook

Shuyib / HF_model_preview

Star

Using LLMs in huggingface for sentiment analysis, translation, summarization and extractive question answering

translation sentiment-analysis question-answering summarization explainable-ai encoder-model decoder-model extractive-question-answering llm llm-inference helsinki-nlp qwen2-5 facebook-bart

Updated May 26, 2025
Jupyter Notebook

hardaatbaath / multimodal_vision_model

Star

A multimodal vision model that takes in an image and a prompt query, and output the answer

decoder-model vision-transformer

Updated Jan 12, 2025
Python

kartikey2807 / Transformer-based-GPT-model

Star

Mimic GPT-2 (124 million parameter) like Transformer (decoder) model. Compare performance against OpenAI's Model for prediction accuracy.

attention-model pytorch-nlp gpt-2 decoder-model

Updated Apr 17, 2025

Muhammad-Ibrahim-Khan / minigpt

Star

A miniGPT inspired from the original NanoGPT released by OpenAI. This is a notebook to walk through the decoder part of the transformer architecture with details outlined.

python jupyter-notebook pytorch transformer gpt decoder-model