HiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories (ICDM'19)
-
Updated
Apr 2, 2024 - Python
HiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories (ICDM'19)
Python script that scrapes GitHub repositories to keep track of total clone counts. This is useful for projects that are NSF funded, where "impact" (total downloads) is required to be reported.
Github repo to Neo4j (and back)
SAP Security Research sample code and data to reproduce the research done in our paper "Detecting Backdoors in Collaboration Graphs of Software Repositories".
Dive deep into the history of any Git repository. This Python framework provides comprehensive tools for analyzing source code changes, commit metadata, and developer contributions at a granular level.
⛏️ Repository containing scripts and other resources for finding out how many projects use TDD in practice, based on Git history.
Repository mining for searching Performance Antipatterns in Cyber-Physical System Projects.
Code for searching for institutional dataset records from popular data repositories (e.g., Zenodo)
Mining Linux Kernel Repositories to extract knowledge about AF_XDP
The TRACE tool, developed as part of my Bachelor's thesis in Computer Science at the University of Salerno, is designed to analyze GitHub repositories, with the main purpose of calculating the Cultural Dispersion in a community of developers.
A pipeline processor to extract data from Gitlab and transform into a graph representation
Identifies and clones repositories in which a given bot or user has created at least one issue.
JMT - A multi-language tool to collect code elements impacted by commits.
Add a description, image, and links to the repository-mining topic page so that developers can more easily learn about it.
To associate your repository with the repository-mining topic, visit your repo's landing page and select "manage topics."