#

nlp-resources

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Here are 43 public repositories matching this topic...

juand-r / entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

nlp natural-language-processing annotations named-entity-recognition corpora datasets ner nlp-resources entity-extraction entity-recognition

Updated Nov 29, 2024
Python

neuralmind-ai / portuguese-bert

Portuguese pre-trained BERT models

natural-language-processing deep-learning portuguese nlp-resources bert bert-model

Updated Jun 17, 2024
Python

HKUSTDial / NL2SQL_Handbook

This is a continuously updated handbook for readers to easily track the latest NL2SQL (Text-to-SQL) techniques in the literature and provide practical guidance for researchers and practitioners. If we missed any interesting work, feel free to contact us.

Updated Apr 21, 2025
Python

microsoft / vert-papers

This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).

nlp entity-resolution ml named-entity-recognition ner nlp-resources entity-linking unitrans entity-extraction grn entity-disambiguation language-understanding linkingpark bertel can-ner xl-ner cross-lingual-ner

Updated Mar 16, 2024
Python

WorksApplications / SudachiDict

A lexicon for Sudachi

segmentation nlp-resources pos-tagging morphological-analysis

Updated Jan 29, 2025
Python

INK-USC / TriggerNER

TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)

information-extraction dataset named-entity-recognition nlp-resources nlp-datasets low-resource sequence-tagging

Updated Jun 15, 2022
Python

NLP-Guide

mikeroyal / NLP-Guide

Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.

nlp natural-language-processing awesome machine-translation natural-language speech-synthesis speech-recognition awesome-list nlp-parsing nlp-resources nlp-library semantic-search speech-processing nlp-machine-learning nlp-keywords-extraction speech-enhancement langauge-model gpt-3 natural-language-procressing

Updated Jan 4, 2024
Python

laymonage / kbbi-python

A Python module that fetches a page of a word/phrase from the Online Indonesian Dictionary (https://kbbi.kemdikbud.go.id).

nlp-resources indonesian-language

Updated Sep 12, 2023
Python

StatguyUser / TextFeatureSelection

Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models

nlp machine-learning natural-language-processing text-classification natural-language feature-selection machinelearning natural-language-generation nlp-resources nlp-library natural-language-inference nlp-machine-learning natural-language-understanding text-categorization nlproc naturallanguageprocessing

Updated Jan 4, 2024
Python

nguynking / CS224N

Assignment solutions for CS224N: Natural Language Processing with Deep Learning - Stanford / Winter 2023

machine-learning natural-language-processing deep-learning stanford nlp-resources cs224n cs224n-assignment-solutions

Updated Aug 30, 2023
Python

erickrf / ppdb

Interface for reading the Paraphrase Database (PPDB)

nlp natural-language-processing nlp-resources nlp-library

Updated Mar 14, 2018
Python

eric-haibin-lin / nlp-notebooks

A collection of natural language processing notebooks.

nlp natural-language-processing deep-learning natural-language-generation nlp-resources deep-learning-tutorial natural-language-inference natural-language-understanding

Updated Jul 10, 2019
Python

samhavens / roundtrip

Roundtrip translation (aka back translation) python package

python nlp translation nlp-resources backtranslation round-trip

Updated Dec 8, 2022
Python

yuanjie-ai / iNLP

https://pypi.org/project/iNLP/

nlp nlp-apis nlp-parsing nlp-resources nlp-library nlp-machine-learning nlp-keywords-extraction

Updated Dec 4, 2018
Python

anlausch / DEBIE

Debiasing word embeddings

word-embeddings nlp-resources nlp-machine-learning ethical-artificial-intelligence

Updated Apr 17, 2020
Python

mnschmit / SherLIiC

A Typed Event-Focused Lexical Inference Benchmark for Evaluating Natural Language Inference

nlp challenge acl nlp-resources lexical-semantics nli nlp-datasets acl2019 lexical-inference

Updated Apr 17, 2020
Python

sarves / thamizhi-pos

ThamizhiPOSt - A neural based POS tagger for Tamil

nlp-resources pos-tagger stanza tamil-language tamil-nlp thamizhimorph

Updated Jan 11, 2021
Python

Esukhia / ud-pos-tagger-bo

Basic Universal Dependencies Part-of-Speech Tagger for Tibetan

nlp pos-tag nlp-resources nlp-library tibetan universal-dependencies pos-tagging pos-tagger tibetan-nlp tibetan-pos

Updated Mar 22, 2019
Python

ChmHsm / latinAr

An extensive dataset for latin-written arabic.

python nlp language data dataset arabic nlp-resources arabic-nlp arabic-language

Updated Aug 23, 2018
Python

lukyjanek / universal-derivations

The scripts for compiling the Universal Derivations collections of harmonised word-formation resources for multiple langugaes.

morphology nlp-resources language-resources word-formation universal-derivations uder-collection

Updated Nov 16, 2021
Python

Created by Alan Turing

Followers: 25.6k followers
Wikipedia: Wikipedia