This curated list contains 880 awesome open-source projects with a total of 3M stars grouped into 33 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!
Contents
- Machine Learning Frameworks 55 projects
- Data Visualization 50 projects
- Text Data & NLP 90 projects
- Image Data 56 projects
- Graph Data 32 projects
- Audio Data 27 projects
- Geospatial Data 21 projects
- Financial Data 23 projects
- Time Series Data 22 projects
- Medical Data 19 projects
- Tabular Data 3 projects
- Optical Character Recognition 11 projects
- Data Containers & Structures 29 projects
- Data Loading & Extraction 1 projects
- Web Scraping & Crawling 1 projects
- Data Pipelines & Streaming 41 projects
- Distributed Machine Learning 29 projects
- Hyperparameter Optimization & AutoML 47 projects
- Reinforcement Learning 21 projects
- Recommender Systems 15 projects
- Privacy Machine Learning 6 projects
- Workflow & Experiment Tracking 36 projects
- Model Serialization & Deployment 14 projects
- Model Interpretability 50 projects
- Vector Similarity Search (ANN) 12 projects
- Probabilistics & Statistics 23 projects
- Adversarial Robustness 9 projects
- GPU Utilities 18 projects
- Tensorflow Utilities 15 projects
- Sklearn Utilities 17 projects
- Pytorch Utilities 31 projects
- Database Clients 1 projects
- Others 57 projects
Explanation
🥇 🥈 🥉 Combined project-quality score⭐️ Star count from GitHub🐣 New project (less than 6 months old)💤 Inactive project (6 months no activity)💀 Dead project (12 months no activity)📈 📉 Project is trending up or down➕ Project was recently added❗️ Warning (e.g. missing/risky license)👨💻 Contributors count from GitHub🔀 Fork count from GitHub📋 Issue count from GitHub⏱️ Last update timestamp on package manager📥 Download count from package manager📦 Number of dependent projectsTensorflow related project
Sklearn related project
PyTorch related project
MxNet related project
Apache Spark related project
Jupyter related project
PaddlePaddle related project
Pandas related project
Machine Learning Frameworks
General-purpose machine learning and deep learning frameworks.
Tensorflow (🥇 44 · ⭐ 160K) - An Open Source Machine Learning Framework for Everyone. Apache-2

-
GitHub (
👨💻 3.7K ·🔀 85K ·📦 150K ·📋 32K - 11% open ·⏱️ 15.07.2021):git clone https://github.com/tensorflow/tensorflow
-
PyPi (
📥 10M / month ·📦 23K ·⏱️ 13.07.2021):pip install tensorflow
-
Conda (
📥 2.7M ·⏱️ 30.04.2021):conda install -c conda-forge tensorflow
-
Docker Hub (
📥 56M ·⭐ 1.9K ·⏱️ 15.07.2021):docker pull tensorflow/tensorflow
scikit-learn (🥇 38 · ⭐ 46K) - scikit-learn: machine learning in Python. BSD-3

XGBoost (🥇 37 · ⭐ 21K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
LightGBM (🥇 36 · ⭐ 13K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
pytorch-lightning (🥈 34 · ⭐ 15K) - The lightweight PyTorch wrapper for high-performance.. Apache-2

Theano (🥈 34 · ⭐ 9.4K) - Theano was a Python library that allows you to define, optimize, and.. BSD-3
StatsModels (🥈 33 · ⭐ 6.4K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
Thinc (🥈 32 · ⭐ 2.3K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
PaddlePaddle (🥈 31 · ⭐ 16K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2

Vowpal Wabbit (🥈 30 · ⭐ 7.6K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
tensorpack (🥉 29 · ⭐ 6K) - A Neural Net Training Interface on TensorFlow, with focus on.. Apache-2

Jina (🥉 28 · ⭐ 7.6K) - Cloud-native neural search framework for kind of data. Apache-2
-
GitHub (
👨💻 110 ·🔀 1K ·📦 110 ·📋 900 - 6% open ·⏱️ 15.07.2021):git clone https://github.com/jina-ai/jina
-
PyPi (
📥 30K / month ·⏱️ 15.07.2021):pip install jina
-
Docker Hub (
📥 740K ·⭐ 4 ·⏱️ 15.07.2021):docker pull jinaai/jina
Flax (🥉 27 · ⭐ 1.9K · 📉 ) - Flax is a neural network library for JAX that is designed for.. Apache-2
jax
Turi Create (🥉 26 · ⭐ 10K) - Turi Create simplifies the development of custom machine learning.. BSD-3
Neural Network Libraries (🥉 25 · ⭐ 2.5K) - Neural Network Libraries. Apache-2
tensorflow-upstream (🥉 25 · ⭐ 560) - TensorFlow ROCm port. Apache-2

SHOGUN (🥉 22 · ⭐ 2.8K · 💤 ) - Unified and efficient Machine Learning. BSD-3
-
GitHub (
👨💻 250 ·🔀 1K ·📋 1.5K - 29% open ·⏱️ 08.12.2020):git clone https://github.com/shogun-toolbox/shogun
-
Conda (
📥 100K ·⏱️ 25.06.2018):conda install -c conda-forge shogun
-
Docker Hub (
📥 1.5K ·⭐ 1 ·⏱️ 31.01.2019):docker pull shogun/shogun
mace (🥉 21 · ⭐ 4.4K) - MACE is a deep learning inference framework optimized for mobile.. Apache-2
-
GitHub (
👨💻 63 ·🔀 770 ·📥 1.4K ·📋 650 - 6% open ·⏱️ 15.07.2021):git clone https://github.com/XiaoMi/mace
Neural Tangents (🥉 19 · ⭐ 1.5K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
ThunderSVM (🥉 19 · ⭐ 1.3K) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
Haiku (🥉 19 · ⭐ 1.2K) - JAX-based neural network library. Apache-2
-
GitHub (
👨💻 43 ·🔀 86 ·📦 120 ·📋 96 - 23% open ·⏱️ 15.07.2021):git clone https://github.com/deepmind/dm-haiku
Torchbearer (🥉 19 · ⭐ 600) - torchbearer: A model fitting library for PyTorch. MIT

NeoML (🥉 16 · ⭐ 630) - Machine learning framework for both deep learning and traditional.. Apache-2
-
GitHub (
👨💻 21 ·🔀 86 ·📋 51 - 60% open ·⏱️ 14.07.2021):git clone https://github.com/neoml-lib/neoml
ThunderGBM (🥉 15 · ⭐ 600) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
Show 11 hidden projects...
- dlib (
🥈 32 ·⭐ 10K) - A toolkit for making real world machine learning and data analysis..❗️BSL-1.0
- CNTK (
🥉 26 ·⭐ 17K ·💀 ) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit.MIT
- NuPIC (
🥉 24 ·⭐ 6.3K ·💀 ) - Numenta Platform for Intelligent Computing is an implementation..❗️AGPL-3.0
- Lasagne (
🥉 24 ·⭐ 3.8K ·💀 ) - Lightweight library to build and train neural networks in Theano.MIT
- xLearn (
🥉 24 ·⭐ 2.9K ·💀 ) - High performance, easy-to-use, and scalable machine learning (ML)..Apache-2
- neon (
🥉 23 ·⭐ 3.9K ·💀 ) - Intel Nervana reference deep learning framework committed to best..Apache-2
- NeuPy (
🥉 23 ·⭐ 690 ·💀 ) - NeuPy is a Tensorflow based python library for prototyping and building..MIT
- MindsDB (
🥉 20 ·⭐ 3.8K) - Predictive AI layer for existing databases.❗️GPL-3.0
- chefboost (
🥉 20 ·⭐ 260) - A Lightweight Decision Tree Framework supporting regular algorithms:..MIT
- elegy (
🥉 17 ·⭐ 230) - Elegy is a framework-agnostic Trainer interface for the Jax..Apache-2
jax
- StarSpace (
🥉 13 ·⭐ 3.6K ·💀 ) - Learning embeddings for classification, retrieval and ranking.MIT
Data Visualization
General-purpose and task-specific data visualization libraries.
Matplotlib (🥇 42 · ⭐ 14K) - matplotlib: plotting with Python. Python-2.0
Plotly (🥇 36 · ⭐ 9.8K) - The interactive graphing library for Python (includes Plotly Express). MIT
-
GitHub (
👨💻 180 ·🔀 1.8K ·📦 5 ·📋 2K - 45% open ·⏱️ 28.06.2021):git clone https://github.com/plotly/plotly.py
-
PyPi (
📥 5.7M / month ·📦 5K ·⏱️ 28.06.2021):pip install plotly
-
Conda (
📥 1.6M ·⏱️ 28.06.2021):conda install -c conda-forge plotly
-
NPM (
📥 45K / month ·📦 4 ·⏱️ 12.01.2021):npm install plotlywidget
dash (🥇 33 · ⭐ 15K) - Analytical Web Apps for Python, R, Julia, and Jupyter. No JavaScript Required. MIT
pandas-profiling (🥈 32 · ⭐ 7.6K) - Create HTML profiling reports from pandas DataFrame.. MIT


HoloViews (🥈 29 · ⭐ 1.9K) - With Holoviews, your data visualizes itself. BSD-3

-
GitHub (
👨💻 110 ·🔀 320 ·📋 2.6K - 28% open ·⏱️ 13.07.2021):git clone https://github.com/holoviz/holoviews
-
PyPi (
📥 130K / month ·📦 170 ·⏱️ 22.05.2021):pip install holoviews
-
Conda (
📥 530K ·⏱️ 23.05.2021):conda install -c conda-forge holoviews
-
NPM (
📥 5K / month ·⏱️ 24.05.2020):npm install @pyviz/jupyterlab_pyviz
bqplot (🥈 28 · ⭐ 3.1K) - Plotting library for IPython/Jupyter notebooks. Apache-2

-
GitHub (
👨💻 53 ·🔀 410 ·📦 26 ·📋 540 - 35% open ·⏱️ 15.07.2021):git clone https://github.com/bqplot/bqplot
-
PyPi (
📥 64K / month ·📦 110 ·⏱️ 08.06.2021):pip install bqplot
-
Conda (
📥 680K ·⏱️ 08.06.2021):conda install -c conda-forge bqplot
-
NPM (
📥 12K / month ·📦 10 ·⏱️ 08.06.2021):npm install bqplot
datashader (🥈 28 · ⭐ 2.5K) - Quickly and accurately render even the largest data. BSD-3
data-validation (🥈 28 · ⭐ 560) - Library for exploring and validating machine learning.. Apache-2


Perspective (🥈 27 · ⭐ 3.4K) - Streaming pivot visualization via WebAssembly. Apache-2

Facets Overview (🥉 26 · ⭐ 6.6K) - Visualizations for machine learning datasets. Apache-2

HyperTools (🥉 26 · ⭐ 1.7K) - A Python toolbox for gaining geometric insights into high-dimensional.. MIT
D-Tale (🥉 25 · ⭐ 2.5K) - Visualizer for pandas data structures. ❗️LGPL-2.1


pythreejs (🥉 25 · ⭐ 740) - A Jupyter - Three.js bridge. BSD-3

-
GitHub (
👨💻 27 ·🔀 160 ·📦 17 ·📋 200 - 30% open ·⏱️ 26.02.2021):git clone https://github.com/jupyter-widgets/pythreejs
-
PyPi (
📥 36K / month ·📦 26 ·⏱️ 26.02.2021):pip install pythreejs
-
Conda (
📥 320K ·⏱️ 02.03.2021):conda install -c conda-forge pythreejs
-
NPM (
📥 7K / month ·📦 8 ·⏱️ 26.02.2021):npm install jupyter-threejs
hvPlot (🥉 25 · ⭐ 410) - A high-level plotting API for pandas, dask, xarray, and networkx built on.. BSD-3
python-ternary (🥉 24 · ⭐ 440) - Ternary plotting library for python with matplotlib. MIT
Chartify (🥉 23 · ⭐ 2.9K) - Python library that makes it easy for data scientists to create.. Apache-2
Multicore-TSNE (🥉 23 · ⭐ 1.6K · 💤 ) - Parallel t-SNE implementation with Python and Torch.. BSD-3

Pandas-Bokeh (🥉 23 · ⭐ 700) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT

Sweetviz (🥉 21 · ⭐ 1.7K) - Visualize and compare datasets, target values and associations, with one.. MIT
AutoViz (🥉 21 · ⭐ 390) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
animatplot (🥉 18 · ⭐ 380 · 💤 ) - A python package for animating plots build on matplotlib. MIT
Show 8 hidden projects...
- plotnine (
🥈 28 ·⭐ 2.7K) - A grammar of graphics for Python.❗️GPL-2.0
- cartopy (
🥈 27 ·⭐ 890) - Cartopy - a cartographic python library with matplotlib support.❗️LGPL-3.0
- pivottablejs (
🥉 21 ·⭐ 440 ·💀 ) - Dragndrop Pivot Tables and Charts for Jupyter/IPython..MIT
- ivis (
🥉 20 ·⭐ 240) - Dimensionality reduction in very large datasets using Siamese..Apache-2
- pdvega (
🥉 16 ·⭐ 340 ·💀 ) - Interactive plotting for Pandas using Vega-Lite.MIT
- nx-altair (
🥉 16 ·⭐ 170 ·💀 ) - Draw interactive NetworkX graphs with Altair.MIT
- data-describe (
🥉 15 ·⭐ 280) - datadescribe: Pythonic EDA Accelerator for Data Science.Apache-2
- nptsne (
🥉 14 ·⭐ 25) - nptsne is a numpy compatible python binary package that offers a number..Apache-2
Text Data & NLP
Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.
transformers (🥇 36 · ⭐ 49K) - Transformers: State-of-the-art Natural Language.. Apache-2


gensim (🥇 36 · ⭐ 12K) - Topic Modelling for Humans. ❗️LGPL-2.1
nltk (🥇 35 · ⭐ 10K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
flair (🥇 32 · ⭐ 11K) - A very simple framework for state-of-the-art Natural Language Processing.. MIT

ChatterBot (🥇 31 · ⭐ 11K) - ChatterBot is a machine learning, conversational dialog engine for.. BSD-3
sentencepiece (🥇 31 · ⭐ 5.2K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
TextBlob (🥈 30 · ⭐ 7.7K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech.. MIT
sentence-transformers (🥈 30 · ⭐ 5.6K) - Multilingual Sentence & Image Embeddings with BERT. Apache-2

snowballstemmer (🥈 30 · ⭐ 500) - Snowball compiler and stemming algorithms. BSD-3
DeepPavlov (🥈 28 · ⭐ 5.3K) - An open source library for deep learning end-to-end dialog.. Apache-2

Tokenizers (🥈 28 · ⭐ 4.7K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
TensorFlow Text (🥈 27 · ⭐ 770) - Making text a first-class citizen in TensorFlow. Apache-2

vaderSentiment (🥈 26 · ⭐ 3.1K) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary and.. MIT
haystack (🥈 26 · ⭐ 2.1K) - End-to-end Python framework for building natural language search.. Apache-2
textgenrnn (🥈 25 · ⭐ 4.5K · 💤 ) - Easily train your own text-generating neural network of any.. MIT

neuralcoref (🥈 25 · ⭐ 2.3K) - Fast Coreference Resolution in spaCy with Neural Networks. MIT
TextDistance (🥈 25 · ⭐ 2K) - Compute distance between sequences. 30+ algorithms, pure python.. MIT
scattertext (🥈 25 · ⭐ 1.6K) - Beautiful visualizations of how language differs among document.. Apache-2
spacy-transformers (🥈 25 · ⭐ 980) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT
spacy
Ciphey (🥉 24 · ⭐ 7.4K) - Automatically decrypt encryptions without knowing the key or cipher,.. MIT
-
GitHub (
👨💻 45 ·🔀 420 ·📋 260 - 22% open ·⏱️ 14.07.2021):git clone https://github.com/Ciphey/Ciphey
-
PyPi (
📥 8.3K / month ·⏱️ 06.06.2021):pip install ciphey
-
Docker Hub (
📥 11K ·⭐ 4 ·⏱️ 06.06.2021):docker pull remnux/ciphey
fastNLP (🥉 24 · ⭐ 2.2K) - fastNLP: A Modularized and Extensible NLP Framework. Currently still.. Apache-2
pytorch-nlp (🥉 24 · ⭐ 1.9K) - Basic Utilities for PyTorch Natural Language Processing (NLP). BSD-3

DeepMatcher (🥉 23 · ⭐ 3.7K) - Python package for performing Entity and Text Matching using Deep.. BSD-3
PyTextRank (🥉 23 · ⭐ 1.6K) - Python implementation of TextRank for phrase extraction and.. MIT
SciSpacy (🥉 23 · ⭐ 960) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
english-words (🥉 22 · ⭐ 5.3K · 💤 ) - A text file containing 479k English words for all your.. Unlicense
gpt-2-simple (🥉 22 · ⭐ 2.7K) - Python package to easily retrain OpenAI's GPT-2 text-.. MIT

Texar (🥉 22 · ⭐ 2.2K · 💤 ) - Toolkit for Machine Learning, Natural Language Processing, and.. Apache-2

pySBD (🥉 22 · ⭐ 330) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence.. MIT
Texthero (🥉 21 · ⭐ 2.3K) - Text preprocessing, representation and visualization from zero to hero. MIT
NLP Architect (🥉 20 · ⭐ 2.7K) - A model library for exploring state-of-the-art deep learning.. Apache-2
DELTA (🥉 20 · ⭐ 1.4K · 💤 ) - DELTA is a deep learning based natural language and speech.. Apache-2

-
GitHub (
👨💻 41 ·🔀 290 ·📋 76 - 6% open ·⏱️ 17.12.2020):git clone https://github.com/Delta-ML/delta
-
PyPi (
📥 27 / month ·⏱️ 27.03.2020):pip install delta-nlp
-
Docker Hub (
📥 13K ·⏱️ 14.07.2021):docker pull zh794390558/delta
YouTokenToMe (🥉 20 · ⭐ 750) - Unsupervised text tokenizer focused on computational efficiency. MIT
lightseq (🥉 18 · ⭐ 1.3K) - LightSeq: A High Performance Library for Sequence Processing and.. Apache-2
nboost (🥉 18 · ⭐ 580 · 💤 ) - NBoost is a scalable, search-api-boosting platform for deploying.. Apache-2
OpenNRE (🥉 15 · ⭐ 3.2K) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
-
GitHub (
👨💻 9 ·🔀 860 ·📋 330 - 6% open ·⏱️ 31.05.2021):git clone https://github.com/thunlp/OpenNRE
BLINK (🥉 12 · ⭐ 670) - Entity Linker solution. MIT
-
GitHub (
👨💻 16 ·🔀 110 ·📋 62 - 54% open ·⏱️ 02.04.2021):git clone https://github.com/facebookresearch/BLINK
Show 19 hidden projects...
- fuzzywuzzy (
🥇 31 ·⭐ 8.3K) - Fuzzy String Matching in Python.❗️GPL-2.0
- langid (
🥈 26 ·⭐ 1.8K ·💀 ) - Stand-alone language identification system.BSD-3
- polyglot (
🥈 25 ·⭐ 1.9K ·💤 ) - Multilingual text (NLP) processing toolkit.❗️GPL-3.0
- flashtext (
🥉 23 ·⭐ 4.9K ·💀 ) - Extract Keywords from sentence or Replace keywords in sentences.MIT
- stop-words (
🥉 22 ·⭐ 130 ·💀 ) - Get list of common stop words in various languages in Python.BSD-3
- NeuroNER (
🥉 19 ·⭐ 1.6K ·💀 ) - Named-entity recognition using neural networks. Easy-to-use and..MIT
- pyfasttext (
🥉 19 ·⭐ 230 ·💀 ) - Yet another Python binding for fastText.❗️GPL-3.0
- textpipe (
🥉 18 ·⭐ 290) - Textpipe: clean and extract metadata from text.MIT
- textaugment (
🥉 18 ·⭐ 150) - TextAugment: Text Augmentation Library.MIT
- TextBox (
🥉 16 ·⭐ 280) - TextBox is an open-source library for building text generation system.MIT
- Headliner (
🥉 16 ·⭐ 230 ·💀 ) - Easy training and deployment of seq2seq models.MIT
- skift (
🥉 16 ·⭐ 220) - scikit-learn wrappers for Python fastText.MIT
- TransferNLP (
🥉 15 ·⭐ 290 ·💀 ) - NLP library designed for reproducible experimentation..MIT
- NeuralQA (
🥉 15 ·⭐ 200 ·💤 ) - NeuralQA: A Usable Library for Question Answering on Large Datasets..MIT
- ONNX-T5 (
🥉 15 ·⭐ 170) - Summarization, translation, sentiment-analysis, text-generation and..Apache-2
- textvec (
🥉 14 ·⭐ 170 ·💤 ) - Text vectorization tool to outperform TFIDF for classification..MIT
- fastT5 (
🥉 13 ·⭐ 180 ·🐣 ) - boost inference speed of T5 models by 5x & reduce the model size..Apache-2
- numerizer (
🥉 13 ·⭐ 120) - A Python module to convert natural language numerics into ints and..MIT
- spacy-dbpedia-spotlight (
🥉 12 ·⭐ 37) - A spaCy wrapper for DBpedia Spotlight.MIT
spacy
Image Data
Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.
torchvision (🥇 36 · ⭐ 9.4K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3

scikit-image (🥇 33 · ⭐ 4.4K) - Image processing in Python. BSD-2
Albumentations (🥇 31 · ⭐ 8.4K) - Fast image augmentation library and an easy-to-use wrapper.. MIT

opencv-python (🥇 31 · ⭐ 2.1K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
Face Recognition (🥈 29 · ⭐ 41K · 📉 ) - The world's simplest facial recognition api for.. MIT

detectron2 (🥈 29 · ⭐ 17K) - Detectron2 is FAIR's next-generation platform for object.. Apache-2

PyTorch Image Models (🥈 29 · ⭐ 12K) - PyTorch image models, scripts, pretrained weights --.. Apache-2

-
GitHub (
👨💻 52 ·🔀 1.8K ·📥 480K ·📦 790 ·📋 350 - 11% open ·⏱️ 13.07.2021):git clone https://github.com/rwightman/pytorch-image-models
InsightFace (🥈 29 · ⭐ 9.6K) - Face Analysis Project on PyTorch and MXNet. MIT

imageai (🥈 27 · ⭐ 6.3K) - A python library built to empower developers to build applications and.. MIT
MMDetection (🥈 26 · ⭐ 16K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2

-
GitHub (
👨💻 240 ·🔀 5.5K ·📦 77 ·📋 4.1K - 8% open ·⏱️ 13.07.2021):git clone https://github.com/open-mmlab/mmdetection
facenet-pytorch (🥈 26 · ⭐ 2.2K) - Pretrained Pytorch face detection (MTCNN) and facial.. MIT

Face Alignment (🥉 24 · ⭐ 5.1K) - 2D and 3D Face alignment library build using pytorch. BSD-3

CellProfiler (🥉 24 · ⭐ 590) - An open-source application for biological image analysis. BSD-3
Image Super-Resolution (🥉 23 · ⭐ 2.9K) - Super-scale your images and run experiments with.. Apache-2

-
GitHub (
👨💻 10 ·🔀 550 ·📦 58 ·📋 180 - 41% open ·⏱️ 02.06.2021):git clone https://github.com/idealo/image-super-resolution
-
PyPi (
📥 5K / month ·📦 8 ·⏱️ 08.01.2020):pip install ISR
-
Docker Hub (
📥 150 ·⏱️ 01.04.2019):docker pull idealo/image-super-resolution-gpu
Torch Points 3D (🥉 23 · ⭐ 1.4K) - Pytorch framework for doing deep learning on point clouds. BSD-3

Image Deduplicator (🥉 22 · ⭐ 3.7K · 💤 ) - Finding duplicate images made easy!. Apache-2

tensorflow-graphics (🥉 22 · ⭐ 2.5K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2

layout-parser (🥉 22 · ⭐ 2.2K) - A unified toolkit for Deep Learning Based Document Image.. Apache-2
vidgear (🥉 22 · ⭐ 1.8K) - High-performance cross-platform Video Processing Python framework.. Apache-2
Classy Vision (🥉 22 · ⭐ 1.3K) - An end-to-end PyTorch framework for image and video.. MIT

vit-pytorch (🥉 21 · ⭐ 5K) - Implementation of Vision Transformer, a simple way to achieve.. MIT

image-match (🥉 19 · ⭐ 2.6K) - Quickly search over billions of images. Apache-2
Norfair (🥉 19 · ⭐ 1.1K) - Lightweight Python library for adding real-time 2D object tracking to.. BSD-3
Caer (🥉 19 · ⭐ 520 · 📉 ) - A lightweight Computer Vision library. Scale your models, not boilerplate. MIT
PaddleDetection (🥉 18 · ⭐ 4.3K) - Object detection and instance segmentation toolkit.. Apache-2

-
GitHub (
👨💻 63 ·🔀 1.1K ·📦 5 ·📋 2.1K - 21% open ·⏱️ 14.07.2021):git clone https://github.com/PaddlePaddle/PaddleDetection
pytorchvideo (🥉 18 · ⭐ 1.7K · 🐣 ) - A deep learning library for video understanding.. Apache-2

pycls (🥉 17 · ⭐ 1.6K) - Codebase for Image Classification Research, written in PyTorch. MIT

-
GitHub (
👨💻 13 ·🔀 180 ·📦 3 ·📋 67 - 25% open ·⏱️ 09.07.2021):git clone https://github.com/facebookresearch/pycls
DE⫶TR (🥉 16 · ⭐ 7.2K) - End-to-End Object Detection with Transformers. Apache-2

-
GitHub (
👨💻 21 ·🔀 1.2K ·📋 350 - 27% open ·⏱️ 30.06.2021):git clone https://github.com/facebookresearch/detr
PySlowFast (🥉 16 · ⭐ 3.9K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2

-
GitHub (
👨💻 24 ·🔀 760 ·📦 4 ·📋 430 - 48% open ·⏱️ 08.07.2021):git clone https://github.com/facebookresearch/SlowFast
Show 8 hidden projects...
- imgaug (
🥇 32 ·⭐ 11K ·💀 ) - Image augmentation for machine learning experiments.MIT
- glfw (
🥈 30 ·⭐ 7.8K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input.❗️Zlib
- Augmentor (
🥉 25 ·⭐ 4.5K ·💀 ) - Image augmentation library in Python for machine learning.MIT
- chainercv (
🥉 25 ·⭐ 1.5K ·💀 ) - ChainerCV: a Library for Deep Learning in Computer Vision.MIT
- Pillow-SIMD (
🥉 24 ·⭐ 1.6K ·💀 ) - The friendly PIL fork.❗️PIL
- segmentation_models (
🥉 23 ·⭐ 3.3K ·💀 ) - Segmentation models with pretrained backbones. Keras..MIT
- Luminoth (
🥉 22 ·⭐ 2.4K ·💀 ) - Deep Learning toolkit for Computer Vision.BSD-3
- solt (
🥉 17 ·⭐ 250 ·💀 ) - Streaming over lightweight data transformations.MIT
Graph Data
Libraries for graph processing, clustering, embedding, and machine learning tasks.
PyTorch Geometric (🥇 29 · ⭐ 12K) - Geometric Deep Learning Extension Library for PyTorch. MIT

dgl (🥇 28 · ⭐ 7.6K) - Python package built to ease deep learning on graph, on top of existing.. Apache-2
StellarGraph (🥈 26 · ⭐ 2K) - StellarGraph - Machine Learning on Graphs. Apache-2

AmpliGraph (🥈 23 · ⭐ 1.6K) - Python library for Representation Learning on Knowledge.. Apache-2

pygraphistry (🥈 23 · ⭐ 1.4K) - PyGraphistry is a Python library to quickly load, shape,.. BSD-3

PyTorch-BigGraph (🥈 22 · ⭐ 2.8K) - Generate embeddings from large-scale graph-structured.. BSD-3

torch-cluster (🥈 22 · ⭐ 390) - PyTorch Extension Library of Optimized Graph Cluster.. MIT

Paddle Graph Learning (🥉 19 · ⭐ 1.1K) - Paddle Graph Learning (PGL) is an efficient and.. Apache-2

pytorch_geometric_temporal (🥉 19 · ⭐ 870) - A Temporal Extension Library for PyTorch Geometric. MIT

graph-nets (🥉 18 · ⭐ 4.9K · 💤 ) - Build Graph Nets in Tensorflow. Apache-2

GraphEmbedding (🥉 16 · ⭐ 2.1K · 💤 ) - Implementation and experiments of graph embedding.. MIT

-
GitHub (
👨💻 8 ·🔀 640 ·📦 12 ·📋 49 - 71% open ·⏱️ 18.10.2020):git clone https://github.com/shenweichen/GraphEmbedding
OpenKE (🥉 13 · ⭐ 2.6K) - An Open-Source Package for Knowledge Embedding (KE). MIT
-
GitHub (
👨💻 10 ·🔀 800 ·📋 300 - 22% open ·⏱️ 06.04.2021):git clone https://github.com/thunlp/OpenKE
GraphVite (🥉 12 · ⭐ 920) - GraphVite: A General and High-performance Graph Embedding System. Apache-2
Show 10 hidden projects...
- igraph (
🥇 28 ·⭐ 840) - Python interface for igraph.❗️GPL-2.0
- pygal (
🥈 26 ·⭐ 2.4K) - PYthon svg GrAph plotting Library.❗️LGPL-3.0
- Karate Club (
🥈 23 ·⭐ 1.3K) - Karate Club: An API Oriented Open-source Python Framework for..❗️GPL-3.0
- DeepWalk (
🥉 21 ·⭐ 2.3K ·💀 ) - DeepWalk - Deep Learning for Graphs.❗️GPL-3.0
- DIG (
🥉 17 ·⭐ 780) - A library for graph deep learning research.❗️GPL-3.0
- Sematch (
🥉 17 ·⭐ 360 ·💀 ) - semantic similarity framework for knowledge graph.Apache-2
- DeepGraph (
🥉 17 ·⭐ 240) - Analyze Data with Pandas-based Networks. Documentation:.BSD-3
- pyRDF2Vec (
🥉 17 ·⭐ 120) - Python Implementation and Extension of RDF2Vec.MIT
- GraphSAGE (
🥉 14 ·⭐ 2.4K ·💀 ) - Representation learning on large graphs using stochastic..MIT
- OpenNE (
🥉 13 ·⭐ 1.5K ·💀 ) - An Open-Source Package for Network Embedding (NE).MIT
Audio Data
Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.
DeepSpeech (🥇 30 · ⭐ 18K · 📈 ) - DeepSpeech is an open source embedded (offline, on-.. MPL-2.0

torchaudio (🥇 30 · ⭐ 1.4K) - Data manipulation and transformation for audio signal.. BSD-2

pyAudioAnalysis (🥈 27 · ⭐ 4K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
python-soundfile (🥈 25 · ⭐ 390 · 💤 ) - SoundFile is an audio library based on libsndfile, CFFI,.. BSD-3
python_speech_features (🥉 24 · ⭐ 1.9K · 💤 ) - This library provides common speech features for ASR.. MIT
speechbrain (🥉 22 · ⭐ 2.6K) - A PyTorch-based Speech Toolkit. Apache-2

audiomentations (🥉 22 · ⭐ 620) - A Python library for audio data augmentation. Inspired by.. MIT
tinytag (🥉 22 · ⭐ 470) - Read music meta data and length of MP3, OGG, OPUS, MP4, M4A, FLAC, WMA and.. MIT
TTS (🥉 19 · ⭐ 4.9K) - Deep learning for Text to Speech (Discussion forum:.. MPL-2.0
-
GitHub (
👨💻 56 ·🔀 800 ·📥 820 ·📋 510 - 3% open ·⏱️ 12.02.2021):git clone https://github.com/mozilla/TTS
Show 7 hidden projects...
- SpeechRecognition (
🥇 30 ·⭐ 5.7K ·💀 ) - Speech recognition module for Python, supporting..BSD-3
- aubio (
🥈 26 ·⭐ 2.2K) - a library for audio and music analysis.❗️GPL-3.0
- Essentia (
🥉 24 ·⭐ 1.9K) - C++ library for audio and music analysis, description and..❗️AGPL-3.0
- Madmom (
🥉 22 ·⭐ 770 ·💀 ) - Python audio and music signal processing library.BSD-3
- Dejavu (
🥉 21 ·⭐ 5.5K ·💀 ) - Audio fingerprinting and recognition in Python.MIT
- Muda (
🥉 19 ·⭐ 190) - A library for augmenting annotated audio data.ISC
- Julius (
🥉 17 ·⭐ 200) - Fast PyTorch based DSP for audio and 1D signals.MIT
Geospatial Data
Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.
pydeck (🥇 34 · ⭐ 8.8K) - WebGL2 powered geospatial visualization layers. MIT

-
GitHub (
👨💻 170 ·🔀 1.6K ·📦 1.6K ·📋 2.2K - 4% open ·⏱️ 13.07.2021):git clone https://github.com/visgl/deck.gl
-
PyPi (
📥 350K / month ·📦 2 ·⏱️ 13.04.2021):pip install pydeck
-
Conda (
📥 41K ·⏱️ 13.04.2021):conda install -c conda-forge pydeck
-
NPM (
📥 210K / month ·📦 560 ·⏱️ 06.07.2021):npm install deck.gl
ipyleaflet (🥉 29 · ⭐ 1.2K) - A Jupyter - Leaflet.js bridge. MIT

-
GitHub (
👨💻 68 ·🔀 300 ·📦 940 ·📋 430 - 38% open ·⏱️ 15.07.2021):git clone https://github.com/jupyter-widgets/ipyleaflet
-
PyPi (
📥 48K / month ·📦 98 ·⏱️ 17.06.2021):pip install ipyleaflet
-
Conda (
📥 720K ·⏱️ 17.06.2021):conda install -c conda-forge ipyleaflet
-
NPM (
📥 25K / month ·📦 2 ·⏱️ 17.06.2021):npm install jupyter-leaflet
ArcGIS API (🥉 24 · ⭐ 1.1K) - Documentation and samples for ArcGIS API for Python. Apache-2
-
GitHub (
👨💻 70 ·🔀 780 ·📋 370 - 34% open ·⏱️ 06.07.2021):git clone https://github.com/Esri/arcgis-python-api
-
PyPi (
📥 36K / month ·📦 20 ·⏱️ 08.07.2021):pip install arcgis
-
Docker Hub (
📥 4.4K ·⭐ 33 ·⏱️ 06.03.2020):docker pull esridocker/arcgis-api-python-notebook
Show 7 hidden projects...
- Geocoder (
🥈 30 ·⭐ 1.4K ·💀 ) - Python Geocoder.MIT
- Sentinelsat (
🥉 23 ·⭐ 640) - Search and download Copernicus Sentinel satellite images.❗️GPL-3.0
- gmaps (
🥉 22 ·⭐ 720 ·💀 ) - Google maps for Jupyter notebooks.BSD-3
- geoplotlib (
🥉 21 ·⭐ 920 ·💀 ) - python toolbox for visualizing geographical data and making maps.MIT
- Satpy (
🥉 21 ·⭐ 730) - Python package for earth-observing satellite data processing.❗️GPL-3.0
- EarthPy (
🥉 20 ·⭐ 270) - A package built to support working with spatial data using open source..BSD-3
- pymap3d (
🥉 19 ·⭐ 200) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef..BSD-2
Financial Data
Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.
yfinance (🥇 29 · ⭐ 5.3K) - Yahoo! Finance market data downloader (+faster Pandas Datareader). Apache-2
Alpha Vantage (🥈 26 · ⭐ 3.4K) - A python wrapper for Alpha Vantage API for financial data. MIT
empyrical (🥈 25 · ⭐ 810 · 💤 ) - Common financial risk and performance metrics. Used by zipline.. Apache-2
TensorTrade (🥉 23 · ⭐ 3.3K · 📉 ) - An open source reinforcement learning framework for.. Apache-2
Enigma Catalyst (🥉 23 · ⭐ 2.2K · 💤 ) - An Algorithmic Trading Library for Crypto-Assets in.. Apache-2
stockstats (🥉 23 · ⭐ 810 · 💤 ) - Supply a wrapper ``StockDataFrame`` based on the.. BSD-3
finmarketpy (🥉 21 · ⭐ 2.6K) - Python library for backtesting trading strategies & analyzing.. Apache-2
tf-quant-finance (🥉 20 · ⭐ 2.7K) - High-performance TensorFlow library for quantitative.. Apache-2

Crypto Signals (🥉 19 · ⭐ 3.2K) - Github.com/CryptoSignal - #1 Quant Trading & Technical Analysis.. MIT
-
GitHub (
👨💻 28 ·🔀 840 ·📋 240 - 17% open ·⏱️ 28.06.2021):git clone https://github.com/CryptoSignal/crypto-signal
-
Docker Hub (
📥 140K ·⭐ 7 ·⏱️ 03.09.2020):docker pull shadowreaver/crypto-signal
Show 7 hidden projects...
- backtrader (
🥈 26 ·⭐ 6.9K) - Python Backtesting library for trading strategies.❗️GPL-3.0
- Alphalens (
🥉 24 ·⭐ 2K ·💀 ) - Performance analysis of predictive (alpha) stock factors.Apache-2
- PyAlgoTrade (
🥉 23 ·⭐ 3.4K ·💀 ) - Python Algorithmic Trading Library.Apache-2
- FinTA (
🥉 23 ·⭐ 1.2K) - Common financial technical indicators implemented in Pandas.❗️LGPL-3.0
- arch (
🥉 23 ·⭐ 750) - ARCH models in Python.❗️NCSA
- Backtesting.py (
🥉 18 ·⭐ 1.5K) - Backtest trading strategies in Python.❗️AGPL-3.0
- surpriver (
🥉 12 ·⭐ 1.3K ·💤 ) - Find big moving stocks before they move using machine..❗️GPL-3.0
Time Series Data
Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.
Prophet (🥇 29 · ⭐ 13K) - Tool for producing high quality forecasts for time series data that has.. MIT
pmdarima (🥇 28 · ⭐ 930) - A statistical library designed to fill the void in Python's time series.. MIT
STUMPY (🥈 22 · ⭐ 1.8K) - STUMPY is a powerful and scalable Python library for computing a Matrix.. BSD-3
Darts (🥈 22 · ⭐ 1.2K) - A python library for easy manipulation and forecasting of time series. Apache-2
-
GitHub (
👨💻 32 ·🔀 140 ·📦 6 ·📋 110 - 20% open ·⏱️ 09.07.2021):git clone https://github.com/unit8co/darts
-
PyPi (
📥 4.3K / month ·⏱️ 09.07.2021):pip install u8darts
-
Docker Hub (
📥 130 ·⏱️ 22.05.2021):docker pull unit8/darts