This curated list contains 900 awesome open-source projects with a total of 3.4M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!
Contents
- Machine Learning Frameworks 56 projects
- Data Visualization 51 projects
- Text Data & NLP 96 projects
- Image Data 60 projects
- Graph Data 36 projects
- Audio Data 28 projects
- Geospatial Data 22 projects
- Financial Data 25 projects
- Time Series Data 26 projects
- Medical Data 19 projects
- Tabular Data 5 projects
- Optical Character Recognition 12 projects
- Data Containers & Structures 0 projects
- Data Loading & Extraction 2 projects
- Web Scraping & Crawling 1 projects
- Data Pipelines & Streaming 43 projects
- Distributed Machine Learning 33 projects
- Hyperparameter Optimization & AutoML 47 projects
- Reinforcement Learning 23 projects
- Recommender Systems 16 projects
- Privacy Machine Learning 6 projects
- Workflow & Experiment Tracking 39 projects
- Model Serialization & Deployment 16 projects
- Model Interpretability 52 projects
- Vector Similarity Search (ANN) 12 projects
- Probabilistics & Statistics 22 projects
- Adversarial Robustness 9 projects
- GPU Utilities 18 projects
- Tensorflow Utilities 15 projects
- Jax Utilities 2 projects
- Sklearn Utilities 17 projects
- Pytorch Utilities 32 projects
- Database Clients 1 projects
- Others 61 projects
Explanation
🥇 🥈 🥉 Combined project-quality score⭐️ Star count from GitHub🐣 New project (less than 6 months old)💤 Inactive project (6 months no activity)💀 Dead project (12 months no activity)📈 📉 Project is trending up or down➕ Project was recently added❗️ Warning (e.g. missing/risky license)👨💻 Contributors count from GitHub🔀 Fork count from GitHub📋 Issue count from GitHub⏱️ Last update timestamp on package manager📥 Download count from package manager📦 Number of dependent projectsTensorflow related project
Sklearn related project
PyTorch related project
MxNet related project
Apache Spark related project
Jupyter related project
PaddlePaddle related project
Pandas related project
Jax related project
Machine Learning Frameworks
General-purpose machine learning and deep learning frameworks.
Tensorflow (🥇 55 · ⭐ 170K) - An Open Source Machine Learning Framework for Everyone. Apache-2

-
GitHub (
👨💻 4K ·🔀 87K ·📦 200K ·📋 35K - 6% open ·⏱️ 19.05.2022):git clone https://github.com/tensorflow/tensorflow
-
PyPi (
📥 16M / month ·📦 14K ·⏱️ 16.05.2022):pip install tensorflow
-
Conda (
📥 3.3M ·⏱️ 18.05.2022):conda install -c conda-forge tensorflow
-
Docker Hub (
📥 65M ·⭐ 2K ·⏱️ 19.05.2022):docker pull tensorflow/tensorflow
scikit-learn (🥇 51 · ⭐ 50K) - scikit-learn: machine learning in Python. BSD-3

XGBoost (🥇 44 · ⭐ 23K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
jax (🥇 44 · ⭐ 18K · 📈 ) - Composable transformations of Python+NumPy programs: differentiate,.. Apache-2
pytorch-lightning (🥈 43 · ⭐ 18K · 📈 ) - The lightweight PyTorch wrapper for high-performance.. Apache-2

StatsModels (🥈 43 · ⭐ 7.4K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
LightGBM (🥈 42 · ⭐ 14K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
PaddlePaddle (🥈 41 · ⭐ 18K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2

Catboost (🥈 40 · ⭐ 6.5K) - A fast, scalable, high performance Gradient Boosting on Decision.. Apache-2
Jina (🥈 38 · ⭐ 15K) - Cloud-native neural search framework for kind of data. Apache-2
-
GitHub (
👨💻 140 ·🔀 1.9K ·📦 280 ·📋 1.5K - 4% open ·⏱️ 19.05.2022):git clone https://github.com/jina-ai/jina
-
PyPi (
📥 58K / month ·📦 2 ·⏱️ 19.05.2022):pip install jina
-
Conda (
📥 4.8K ·⏱️ 22.04.2022):conda install -c conda-forge jina-core
-
Docker Hub (
📥 1.1M ·⭐ 7 ·⏱️ 19.05.2022):docker pull jinaai/jina
Theano (🥈 38 · ⭐ 9.6K) - Theano was a Python library that allows you to define, optimize, and.. BSD-3
Thinc (🥈 37 · ⭐ 2.5K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
Vowpal Wabbit (🥈 34 · ⭐ 8K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
tensorflow-upstream (🥈 34 · ⭐ 590) - TensorFlow ROCm port. Apache-2

Turi Create (🥉 32 · ⭐ 11K) - Turi Create simplifies the development of custom machine learning.. BSD-3
tensorpack (🥉 32 · ⭐ 6.2K) - A Neural Net Training Interface on TensorFlow, with focus.. Apache-2

einops (🥉 31 · ⭐ 5K) - Deep learning operations reinvented (for pytorch, tensorflow, jax and others). MIT
Neural Network Libraries (🥉 30 · ⭐ 2.5K) - Neural Network Libraries. Apache-2
Neural Tangents (🥉 26 · ⭐ 1.8K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
mace (🥉 24 · ⭐ 4.6K) - MACE is a deep learning inference framework optimized for mobile.. Apache-2
-
GitHub (
👨💻 63 ·🔀 800 ·📥 1.4K ·📋 670 - 7% open ·⏱️ 11.02.2022):git clone https://github.com/XiaoMi/mace
Towhee (🥉 23 · ⭐ 480) - A framework that provides a simple API for developing ML-driven data.. Apache-2
ThunderSVM (🥉 20 · ⭐ 1.4K) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
chefboost (🥉 18 · ⭐ 330) - A Lightweight Decision Tree Framework supporting regular algorithms:.. MIT
Show 13 hidden projects...
- dlib (
🥈 38 ·⭐ 11K) - A toolkit for making real world machine learning and data analysis..❗️BSL-1.0
- TFlearn (
🥉 32 ·⭐ 9.6K ·💀 ) - Deep learning library featuring a higher-level API for TensorFlow.MIT
- CNTK (
🥉 31 ·⭐ 17K ·💀 ) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit.MIT
- Lasagne (
🥉 29 ·⭐ 3.8K ·💀 ) - Lightweight library to build and train neural networks in Theano.MIT
- MindsDB (
🥉 28 ·⭐ 6.8K) - In-Database Machine Learning.❗️GPL-3.0
- NuPIC (
🥉 28 ·⭐ 6.3K ·💀 ) - Numenta Platform for Intelligent Computing is an implementation..❗️AGPL-3.0
- SHOGUN (
🥉 26 ·⭐ 2.9K ·💀 ) - Unified and efficient Machine Learning.BSD-3
- xLearn (
🥉 25 ·⭐ 3K ·💀 ) - High performance, easy-to-use, and scalable machine learning (ML)..Apache-2
- NeuPy (
🥉 25 ·⭐ 710 ·💀 ) - NeuPy is a Tensorflow based python library for prototyping and building..MIT
- neon (
🥉 23 ·⭐ 3.9K ·💀 ) - Intel Nervana reference deep learning framework committed to best..Apache-2
- Torchbearer (
🥉 22 ·⭐ 630 ·💀 ) - torchbearer: A model fitting library for PyTorch.MIT
- ThunderGBM (
🥉 16 ·⭐ 620 ·💀 ) - ThunderGBM: Fast GBDTs and Random Forests on GPUs.Apache-2
- StarSpace (
🥉 15 ·⭐ 3.8K ·💀 ) - Learning embeddings for classification, retrieval and ranking.MIT
Data Visualization
General-purpose and task-specific data visualization libraries.
Matplotlib (🥇 49 · ⭐ 15K) - matplotlib: plotting with Python. Python-2.0
Plotly (🥇 42 · ⭐ 12K) - The interactive graphing library for Python (includes Plotly Express). MIT
-
GitHub (
👨💻 200 ·🔀 2.1K ·📦 9 ·📋 2.3K - 49% open ·⏱️ 11.05.2022):git clone https://github.com/plotly/plotly.py
-
PyPi (
📥 7.6M / month ·📦 4K ·⏱️ 11.05.2022):pip install plotly
-
Conda (
📥 2.5M ·⏱️ 11.05.2022):conda install -c conda-forge plotly
-
npm (
📥 45K / month ·📦 4 ·⏱️ 12.01.2021):npm install plotlywidget
dash (🥇 39 · ⭐ 16K) - Analytical Web Apps for Python, R, Julia, and Jupyter. No JavaScript Required. MIT
pandas-profiling (🥈 38 · ⭐ 9K) - Create HTML profiling reports from pandas DataFrame.. MIT


HoloViews (🥈 34 · ⭐ 2.2K) - With Holoviews, your data visualizes itself. BSD-3

-
GitHub (
👨💻 120 ·🔀 360 ·📋 2.8K - 31% open ·⏱️ 18.05.2022):git clone https://github.com/holoviz/holoviews
-
PyPi (
📥 350K / month ·📦 210 ·⏱️ 06.05.2022):pip install holoviews
-
Conda (
📥 710K ·⏱️ 09.05.2022):conda install -c conda-forge holoviews
-
npm (
📥 1.8K / month ·⏱️ 24.05.2020):npm install @pyviz/jupyterlab_pyviz
datashader (🥈 32 · ⭐ 2.8K) - Quickly and accurately render even the largest data. BSD-3
Perspective (🥈 31 · ⭐ 4.5K) - A data visualization and analytics component, especially.. Apache-2

-
GitHub (
👨💻 69 ·🔀 470 ·📦 250 ·📋 520 - 16% open ·⏱️ 19.05.2022):git clone https://github.com/finos/perspective
-
PyPi (
📥 2.7K / month ·📦 9 ·⏱️ 14.03.2022):pip install perspective-python
-
Conda (
📥 59K ·⏱️ 18.05.2022):conda install -c conda-forge perspective
-
npm (
📥 2.8K / month ·⏱️ 13.05.2022):npm install @finos/perspective-jupyterlab
bqplot (🥈 31 · ⭐ 3.3K) - Plotting library for IPython/Jupyter notebooks. Apache-2

-
GitHub (
👨💻 57 ·🔀 460 ·📦 30 ·📋 580 - 38% open ·⏱️ 09.05.2022):git clone https://github.com/bqplot/bqplot
-
PyPi (
📥 70K / month ·📦 92 ·⏱️ 11.02.2022):pip install bqplot
-
Conda (
📥 960K ·⏱️ 11.02.2022):conda install -c conda-forge bqplot
-
npm (
📥 26K / month ·📦 10 ·⏱️ 11.02.2022):npm install bqplot
D-Tale (🥉 30 · ⭐ 3.4K) - Visualizer for pandas data structures. ❗️LGPL-2.1


data-validation (🥉 29 · ⭐ 630) - Library for exploring and validating machine learning.. Apache-2


hvPlot (🥉 29 · ⭐ 560) - A high-level plotting API for pandas, dask, xarray, and networkx built on.. BSD-3
Facets Overview (🥉 27 · ⭐ 6.8K · 💤 ) - Visualizations for machine learning datasets. Apache-2

HyperTools (🥉 26 · ⭐ 1.7K) - A Python toolbox for gaining geometric insights into high-dimensional.. MIT
pythreejs (🥉 26 · ⭐ 810) - A Jupyter - Three.js bridge. BSD-3

-
GitHub (
👨💻 29 ·🔀 180 ·📦 19 ·📋 220 - 33% open ·⏱️ 06.12.2021):git clone https://github.com/jupyter-widgets/pythreejs
-
PyPi (
📥 48K / month ·📦 38 ·⏱️ 26.02.2021):pip install pythreejs
-
Conda (
📥 380K ·⏱️ 02.03.2021):conda install -c conda-forge pythreejs
-
npm (
📥 5.5K / month ·📦 7 ·⏱️ 26.02.2021):npm install jupyter-threejs
Sweetviz (🥉 23 · ⭐ 2K · 💤 ) - Visualize and compare datasets, target values and associations, with.. MIT
AutoViz (🥉 23 · ⭐ 700) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
Pandas-Bokeh (🥉 22 · ⭐ 780) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT

python-ternary (🥉 22 · ⭐ 550) - Ternary plotting library for python with matplotlib. MIT
Show 14 hidden projects...
- cartopy (
🥈 31 ·⭐ 1K) - Cartopy - a cartographic python library with matplotlib support.❗️LGPL-3.0
- Cufflinks (
🥉 29 ·⭐ 2.6K ·💀 ) - Productivity Tools for Plotly + Pandas.MIT
- Multicore-TSNE (
🥉 25 ·⭐ 1.7K ·💀 ) - Parallel t-SNE implementation with Python and Torch..BSD-3
- Chartify (
🥉 24 ·⭐ 3.2K ·💀 ) - Python library that makes it easy for data scientists to create..Apache-2
- pivottablejs (
🥉 23 ·⭐ 460 ·💀 ) - Dragndrop Pivot Tables and Charts for Jupyter/IPython..MIT
- PandasGUI (
🥉 22 ·⭐ 2.6K) - A GUI for Pandas DataFrames.❗️MIT-0
- PDPbox (
🥉 22 ·⭐ 680 ·💀 ) - python partial dependence plot toolbox.MIT
- Popmon (
🥉 21 ·⭐ 300 ·➕ ) - Monitor the stability of a Pandas or Spark dataframe.MIT
- ivis (
🥉 19 ·⭐ 260) - Dimensionality reduction in very large datasets using Siamese..Apache-2
- animatplot (
🥉 17 ·⭐ 390 ·💀 ) - A python package for animating plots build on matplotlib.MIT
- pdvega (
🥉 16 ·⭐ 340 ·💀 ) - Interactive plotting for Pandas using Vega-Lite.MIT
- data-describe (
🥉 16 ·⭐ 290) - datadescribe: Pythonic EDA Accelerator for Data Science.Apache-2
- nx-altair (
🥉 16 ·⭐ 190 ·💀 ) - Draw interactive NetworkX graphs with Altair.MIT
- nptsne (
🥉 14 ·⭐ 28 ·💀 ) - nptsne is a numpy compatible python binary package that offers a..Apache-2
Text Data & NLP
Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.
transformers (🥇 49 · ⭐ 63K) - Transformers: State-of-the-art Machine Learning for.. Apache-2


nltk (🥇 44 · ⭐ 11K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
gensim (🥇 42 · ⭐ 13K) - Topic Modelling for Humans. ❗️LGPL-2.1
flair (🥇 39 · ⭐ 12K) - A very simple framework for state-of-the-art Natural Language Processing.. MIT

ChatterBot (🥇 36 · ⭐ 12K · 💤 ) - ChatterBot is a machine learning, conversational dialog engine.. BSD-3
sentence-transformers (🥈 34 · ⭐ 7.7K) - Multilingual Sentence & Image Embeddings with BERT. Apache-2

sentencepiece (🥈 34 · ⭐ 5.9K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
Tokenizers (🥈 34 · ⭐ 5.6K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
TensorFlow Text (🥈 33 · ⭐ 940) - Making text a first-class citizen in TensorFlow. Apache-2

DeepPavlov (🥈 31 · ⭐ 5.7K) - An open source library for deep learning end-to-end dialog.. Apache-2

haystack (🥈 31 · ⭐ 4.7K) - Haystack is an open source NLP framework that leverages Transformer.. Apache-2
snowballstemmer (🥈 31 · ⭐ 560) - Snowball compiler and stemming algorithms. BSD-3
SciSpacy (🥈 30 · ⭐ 1.2K) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
vaderSentiment (🥈 28 · ⭐ 3.6K) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary and.. MIT
TextDistance (🥈 28 · ⭐ 2.8K) - Compute distance between sequences. 30+ algorithms, pure python.. MIT
neuralcoref (🥈 28 · ⭐ 2.5K · 💤 ) - Fast Coreference Resolution in spaCy with Neural Networks. MIT
PyTextRank (🥈 28 · ⭐ 1.8K) - Python implementation of TextRank algorithms (textgraphs) for phrase.. MIT
Ciphey (🥉 27 · ⭐ 9.9K) - Automatically decrypt encryptions without knowing the key or cipher,.. MIT
-
GitHub (
👨💻 46 ·🔀 610 ·📋 290 - 17% open ·⏱️ 18.05.2022):git clone https://github.com/Ciphey/Ciphey
-
PyPi (
📥 14K / month ·⏱️ 06.06.2021):pip install ciphey
-
Docker Hub (
📥 15K ·⭐ 7 ·⏱️ 14.04.2022):docker pull remnux/ciphey
fastNLP (🥉 27 · ⭐ 2.6K) - fastNLP: A Modularized and Extensible NLP Framework. Currently still.. Apache-2
spacy-transformers (🥉 27 · ⭐ 1.1K) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT
spacy
english-words (🥉 26 · ⭐ 7.1K) - A text file containing 479k English words for all your.. Unlicense
scattertext (🥉 25 · ⭐ 1.8K) - Beautiful visualizations of how language differs among document.. Apache-2
pytorch-nlp (🥉 24 · ⭐ 2.1K · 💤 ) - Basic Utilities for PyTorch Natural Language Processing.. BSD-3

Texthero (🥉 22 · ⭐ 2.5K · 💤 ) - Text preprocessing, representation and visualization from zero to.. MIT
qdrant (🥉 22 · ⭐ 1.6K) - Qdrant - vector similarity search engine with extended filtering.. Apache-2
-
GitHub (
👨💻 24 ·🔀 92 ·📋 200 - 23% open ·⏱️ 19.05.2022):git clone https://github.com/qdrant/qdrant
rubrix (🥉 22 · ⭐ 1.1K) - Rubrix, open-source framework for data-centric NLP. Data annotation.. Apache-2
DeepMatcher (🥉 21 · ⭐ 4.2K · 💤 ) - Python package for performing Entity and Text Matching using.. BSD-3
gpt-2-simple (🥉 21 · ⭐ 2.9K · 💤 ) - Python package to easily retrain OpenAIs GPT-2 text-.. MIT

NLP Architect (🥉 21 · ⭐ 2.8K · 💤 ) - A model library for exploring state-of-the-art deep.. Apache-2
lightseq (🥉 21 · ⭐ 2.1K) - LightSeq: A High Performance Library for Sequence Processing and.. Apache-2
OpenPrompt (🥉 21 · ⭐ 1.4K) - An Open-Source Framework for Prompt-Learning. Apache-2
OpenNRE (🥉 16 · ⭐ 3.6K) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
-
GitHub (
👨💻 10 ·🔀 940 ·📋 350 - 5% open ·⏱️ 06.04.2022):git clone https://github.com/thunlp/OpenNRE
Show 28 hidden projects...
- fuzzywuzzy (
🥈 32 ·⭐ 8.7K ·💤 ) - Fuzzy String Matching in Python.❗️GPL-2.0
- langid (
🥉 27 ·⭐ 2K ·💀 ) - Stand-alone language identification system.BSD-3
- polyglot (
🥉 26 ·⭐ 2K ·💀 ) - Multilingual text (NLP) processing toolkit.❗️GPL-3.0
- flashtext (
🥉 25 ·⭐ 5.2K ·💀 ) - Extract Keywords from sentence or Replace keywords in sentences.MIT
- textgenrnn (
🥉 24 ·⭐ 4.7K ·💀 ) - Easily train your own text-generating neural network of any..MIT
- whoosh (
🥉 24 ·⭐ 230) - Pure-Python full-text search library.❗️BSD-1-Clause
- YouTokenToMe (
🥉 23 ·⭐ 800 ·💀 ) - Unsupervised text tokenizer focused on computational efficiency.MIT
- pySBD (
🥉 23 ·⭐ 440 ·💀 ) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence..MIT
- Texar (
🥉 22 ·⭐ 2.3K ·💀 ) - Toolkit for Machine Learning, Natural Language Processing, and..Apache-2
- DELTA (
🥉 21 ·⭐ 1.5K ·💀 ) - DELTA is a deep learning based natural language and speech..Apache-2
- anaGo (
🥉 21 ·⭐ 1.4K ·💀 ) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition,..MIT
- happy-transformer (
🥉 21 ·⭐ 290) - A package built on top of Hugging Faces transformers..Apache-2
huggingface
- stop-words (
🥉 21 ·⭐ 140 ·💀 ) - Get list of common stop words in various languages in Python.BSD-3
- fastT5 (
🥉 20 ·⭐ 290) - boost inference speed of T5 models by 5x & reduce the model size by 3x.Apache-2
- pyfasttext (
🥉 20 ·⭐ 230 ·💀 ) - Yet another Python binding for fastText.❗️GPL-3.0
- textpipe (
🥉 19 ·⭐ 300 ·💤 ) - Textpipe: clean and extract metadata from text.MIT
- NeuroNER (
🥉 17 ·⭐ 1.6K ·💀 ) - Named-entity recognition using neural networks. Easy-to-use and..MIT
- nboost (
🥉 17 ·⭐ 620 ·💀 ) - NBoost is a scalable, search-api-boosting platform for deploying..Apache-2
- textaugment (
🥉 17 ·⭐ 240) - TextAugment: Text Augmentation Library.MIT
- skift (
🥉 17 ·⭐ 230) - scikit-learn wrappers for Python fastText.MIT
- BLINK (
🥉 15 ·⭐ 880 ·💀 ) - Entity Linker solution.MIT
- NeuralQA (
🥉 15 ·⭐ 220 ·💀 ) - NeuralQA: A Usable Library for Question Answering on Large Datasets..MIT
- Headliner (
🥉 14 ·⭐ 230 ·💀 ) - Easy training and deployment of seq2seq models.MIT
- numerizer (
🥉 14 ·⭐ 140) - A Python module to convert natural language numerics into ints and..MIT
- spacy-dbpedia-spotlight (
🥉 14 ·⭐ 61) - A spaCy wrapper for DBpedia Spotlight.MIT
spacy
- TransferNLP (
🥉 13 ·⭐ 290 ·💀 ) - NLP library designed for reproducible experimentation..MIT
- ONNX-T5 (
🥉 13 ·⭐ 200 ·💀 ) - Summarization, translation, sentiment-analysis, text-generation..Apache-2
- textvec (
🥉 13 ·⭐ 180 ·💀 ) - Text vectorization tool to outperform TFIDF for classification..MIT
Image Data
Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.
scikit-image (🥇 44 · ⭐ 4.9K) - Image processing in Python. BSD-2
torchvision (🥇 42 · ⭐ 12K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3

MMDetection (🥇 37 · ⭐ 20K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2

PyTorch Image Models (🥇 37 · ⭐ 19K) - PyTorch image models, scripts, pretrained weights --.. Apache-2

InsightFace (🥈 34 · ⭐ 12K) - State-of-the-art 2D and 3D Face Analysis Project. MIT

opencv-python (🥈 34 · ⭐ 2.7K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
Face Recognition (🥈 33 · ⭐ 44K · 💤 ) - The worlds simplest facial recognition api for Python.. MIT

detectron2 (🥈 33 · ⭐ 21K) - Detectron2 is a platform for object detection, segmentation.. Apache-2

Albumentations (🥈 33 · ⭐ 10K) - Fast image augmentation library and an easy-to-use wrapper.. MIT

PaddleDetection (🥈 32 · ⭐ 7.7K) - Object Detection toolkit based on PaddlePaddle. It.. Apache-2

imageai (🥈 30 · ⭐ 7K · 💤 ) - A python library built to empower developers to build applications and.. MIT
vit-pytorch (🥈 29 · ⭐ 10K) - Implementation of Vision Transformer, a simple way to achieve.. MIT

Face Alignment (🥉 27 · ⭐ 5.7K · 💤 ) - 2D and 3D Face alignment library build using pytorch. BSD-3

vidgear (🥉 27 · ⭐ 2.2K) - A High-performance cross-platform Video Processing Python framework.. Apache-2
sahi (🥉 27 · ⭐ 1.6K) - A lightweight vision library for performing large scale object detection/.. MIT
layout-parser (🥉 26 · ⭐ 3K) - A Unified Toolkit for Deep Learning Based Document Image.. Apache-2
facenet-pytorch (🥉 25 · ⭐ 2.9K) - Pretrained Pytorch face detection (MTCNN) and facial.. MIT

pytorchvideo (🥉 25 · ⭐ 2.4K) - A deep learning library for video understanding research. Apache-2

tensorflow-graphics (🥉 24 · ⭐ 2.6K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2

icevision (🥉 24 · ⭐ 690) - An Agnostic Computer Vision Framework - Pluggable to any Training.. Apache-2
CellProfiler (🥉 24 · ⭐ 670 · 📉 ) - An open-source application for biological image analysis. BSD-3
deep-daze (🥉 23 · ⭐ 4.2K) - Simple command line tool for text to image generation using OpenAIs.. MIT
Image Super-Resolution (🥉 23 · ⭐ 3.6K · 💤 ) - Super-scale your images and run experiments with.. Apache-2

-
GitHub (
👨💻 10 ·🔀 620 ·📦 83 ·📋 200 - 44% open ·⏱️ 02.06.2021):git clone https://github.com/idealo/image-super-resolution
-
PyPi (
📥 4.4K / month ·📦 5 ·⏱️ 08.01.2020):pip install ISR
-
Docker Hub (
📥 210 ·⏱️ 01.04.2019):docker pull idealo/image-super-resolution-gpu
Classy Vision (🥉 23 · ⭐ 1.5K) - An end-to-end PyTorch framework for image and video.. MIT

Norfair (🥉 22 · ⭐ 1.4K) - Lightweight Python library for adding real-time object tracking to any.. BSD-3
image-match (🥉 21 · ⭐ 2.7K · 💤 ) - Quickly search over billions of images. Apache-2
DE⫶TR (🥉 19 · ⭐ 8.8K) - End-to-End Object Detection with Transformers. Apache-2

-
GitHub (
👨💻 25 ·🔀 1.6K ·📋 430 - 37% open ·⏱️ 07.03.2022):git clone https://github.com/facebookresearch/detr
scenic (🥉 19 · ⭐ 950) - Scenic: A Jax Library for Computer Vision Research and Beyond. Apache-2

-
GitHub (
👨💻 36 ·🔀 110 ·📦 16 ·📋 59 - 57% open ·⏱️ 19.05.2022):git clone https://github.com/google-research/scenic
PySlowFast (🥉 18 · ⭐ 4.8K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2

Caer (🥉 17 · ⭐ 620 · 💤 ) - A lightweight Computer Vision library. Scale your models, not boilerplate. MIT
Show 12 hidden projects...
- glfw (
🥈 36 ·⭐ 9.1K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input.❗️Zlib
- imgaug (
🥈 35 ·⭐ 13K ·💀 ) - Image augmentation for machine learning experiments.MIT
- PyTorch3D (
🥈 29 ·⭐ 6K) - PyTorch3D is FAIRs library of reusable components for deep..❗Unlicensed
- Pillow-SIMD (
🥉 28 ·⭐ 1.8K) - The friendly PIL fork.❗️PIL
- chainercv (
🥉 27 ·⭐ 1.5K ·💀 ) - ChainerCV: a Library for Deep Learning in Computer Vision.MIT
- segmentation_models (
🥉 25 ·⭐ 3.9K ·💀 ) - Segmentation models with pretrained backbones. Keras..MIT
- Image Deduplicator (
🥉 22 ·⭐ 4K ·💀 ) - Finding duplicate images made easy!.Apache-2
- Luminoth (
🥉 21 ·⭐ 2.4K ·💀 ) - Deep Learning toolkit for Computer Vision.BSD-3
- nude.py (
🥉 21 ·⭐ 860 ·💀 ) - Nudity detection with Python.MIT
- solt (
🥉 16 ·⭐ 250 ·💀 ) - Streaming over lightweight data transformations.MIT
- HugsVision (
🥉 15 ·⭐ 160) - HugsVision is a easy to use huggingface wrapper for state-of-the-..MIT
huggingface
- Torch Points 3D (
🥉 14 ·⭐ 58 ·🐣 ) - Pytorch framework for doing deep learning on point..BSD-3
Graph Data
Libraries for graph processing, clustering, embedding, and machine learning tasks.
PyTorch Geometric (🥇 38 · ⭐ 15K · 📈 ) - Graph Neural Network Library for PyTorch. MIT

dgl (🥇 36 · ⭐ 9.6K) - Python package built to ease deep learning on graph, on top of existing.. Apache-2
StellarGraph (🥈 28 · ⭐ 2.4K · 💤 ) - StellarGraph - Machine Learning on Graphs. Apache-2

ogb (🥈 28 · ⭐ 1.3K) - Benchmark datasets, data loaders, and evaluators for graph machine learning. MIT
pygraphistry (🥈 27 · ⭐ 1.6K) - PyGraphistry is a Python library to quickly load, shape,.. BSD-3

Paddle Graph Learning (🥈 27 · ⭐ 1.3K) - Paddle Graph Learning (PGL) is an efficient and.. Apache-2

PyTorch-BigGraph (🥈 25 · ⭐ 3.1K) - Generate embeddings from large-scale graph-structured.. BSD-3

pytorch_geometric_temporal (🥈 25 · ⭐ 1.5K) - PyTorch Geometric Temporal: Spatiotemporal Signal.. MIT

AmpliGraph (🥈 23 · ⭐ 1.8K · 💤 ) - Python library for Representation Learning on Knowledge.. Apache-2

torch-cluster (🥉 22 · ⭐ 520) - PyTorch Extension Library of Optimized Graph Cluster.. MIT

Show 15 hidden projects...
- igraph (
🥇 31 ·⭐ 970) - Python interface for igraph.❗️GPL-2.0
- pygal (
🥈 28 ·⭐ 2.5K) - PYthon svg GrAph plotting Library.❗️LGPL-3.0
- Karate Club (
🥈 23 ·⭐ 1.6K) - Karate Club: An API Oriented Open-source Python Framework for..❗️GPL-3.0
- DeepWalk (
🥉 21 ·⭐ 2.4K ·💀 ) - DeepWalk - Deep Learning for Graphs.❗️GPL-3.0
- DIG (
🥉 21 ·⭐ 1.1K) - A library for graph deep learning research.❗️GPL-3.0
- graph-nets (
🥉 20 ·⭐ 5.1K ·💀 ) - Build Graph Nets in Tensorflow.Apache-2
- pyRDF2Vec (
🥉 19 ·⭐ 150) - Python Implementation and Extension of RDF2Vec.MIT
- Sematch (
🥉 17 ·⭐ 390 ·💀 ) - semantic similarity framework for knowledge graph.Apache-2
- DeepGraph (
🥉 17 ·⭐ 250 ·💤 ) - Analyze Data with Pandas-based Networks. Documentation:.BSD-3
- GraphEmbedding (
🥉 16 ·⭐ 2.8K ·💀 ) - Implementation and experiments of graph embedding..MIT
- OpenKE (
🥉 15 ·⭐ 3K ·💀 ) - An Open-Source Package for Knowledge Embedding (KE).MIT
- Euler (
🥉 15 ·⭐ 2.8K ·💀 ) - A distributed graph deep learning framework.Apache-2
- GraphSAGE (
🥉 15 ·⭐ 2.7K ·💀 ) - Representation learning on large graphs using stochastic..MIT
- OpenNE (
🥉 15 ·⭐ 1.6K ·💀 ) - An Open-Source Package for Network Embedding (NE).MIT
- GraphVite (
🥉 12 ·⭐ 1K ·💀 ) - GraphVite: A General and High-performance Graph Embedding System.Apache-2
Audio Data
Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.
DeepSpeech (🥇 34 · ⭐ 20K · 📈 ) - DeepSpeech is an open source embedded (offline, on-.. MPL-2.0

torchaudio (🥇 33 · ⭐ 1.7K) - Data manipulation and transformation for audio signal.. BSD-2

speechbrain (🥈 32 · ⭐ 4.1K) - A PyTorch-based Speech Toolkit. Apache-2

SpeechRecognition (🥈 31 · ⭐ 6.3K) - Speech recognition module for Python, supporting several.. BSD-3
pyAudioAnalysis (🥈 29 · ⭐ 4.8K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
tinytag (🥈 28 · ⭐ 530) - Read audio and music meta data and duration of MP3, OGG, OPUS, MP4, M4A,.. MIT
audioread (🥈 28 · ⭐ 400) - cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.. MIT
audiomentations (🥉 25 · ⭐ 970) - A Python library for audio data augmentation. Inspired by.. MIT
python-soundfile (🥉 24 · ⭐ 450) - SoundFile is an audio library based on libsndfile, CFFI, and.. BSD-3
Show 8 hidden projects...
- aubio (
🥈 28 ·⭐ 2.7K) - a library for audio and music analysis.❗️GPL-3.0
- Essentia (
🥉 27 ·⭐ 2.1K) - C++ library for audio and music analysis, description and..❗️AGPL-3.0
- python_speech_features (
🥉 24 ·⭐ 2.1K ·💀 ) - This library provides common speech features for ASR..MIT
- TTS (
🥉 22 ·⭐ 5.9K ·💀 ) - Deep learning for Text to Speech (Discussion forum:..MPL-2.0
- Dejavu (
🥉 22 ·⭐ 5.7K ·💀 ) - Audio fingerprinting and recognition in Python.MIT
- TimeSide (
🥉 22 ·⭐ 320) - Scalable audio processing framework written in Python with a..❗️AGPL-3.0
- Muda (
🥉 18 ·⭐ 210 ·💤 ) - A library for augmenting annotated audio data.ISC
- Julius (
🥉 17 ·⭐ 270) - Fast PyTorch based DSP for audio and 1D signals.MIT
Geospatial Data
Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.
pydeck (🥇 42 · ⭐ 9.8K) - WebGL2 powered visualization framework. MIT

-
GitHub (
👨💻 190 ·🔀 1.8K ·📦 4.1K ·📋 2.4K - 6% open ·⏱️ 19.05.2022):git clone https://github.com/visgl/deck.gl
-
PyPi (
📥 980K / month ·📦 23 ·⏱️ 25.10.2021):pip install pydeck
-
Conda (
📥 100K ·⏱️ 26.10.2021):conda install -c conda-forge pydeck
-
npm (
📥 300K / month ·📦 380 ·⏱️ 18.05.2022):npm install deck.gl
ipyleaflet (🥈 33 · ⭐ 1.3K) - A Jupyter - Leaflet.js bridge. MIT

-
GitHub (
👨💻 78 ·🔀 330 ·📦 1.7K ·📋 490 - 38% open ·⏱️ 17.05.2022):git clone https://github.com/jupyter-widgets/ipyleaflet
-
PyPi (
📥 65K / month ·📦 110 ·⏱️ 14.04.2022):pip install ipyleaflet
-
Conda (
📥 820K ·⏱️ 16.05.2022):conda install -c conda-forge ipyleaflet
-
npm (
📥 51K / month ·📦 2 ·⏱️ 14.04.2022):npm install jupyter-leaflet
ArcGIS API (🥉 31 · ⭐ 1.3K) - Documentation and samples for ArcGIS API for Python. Apache-2
-
GitHub (
👨💻 80 ·🔀 880 ·📥 3.3K ·📋 510 - 24% open ·⏱️ 17.05.2022):git clone https://github.com/Esri/arcgis-python-api
-
PyPi (
📥 70K / month ·📦 22 ·⏱️ 03.02.2022):pip install arcgis
-
Docker Hub (
📥 7K ·⭐ 33 ·⏱️ 04.02.2022):docker pull esridocker/arcgis-api-python-notebook
EarthPy (🥉 26 · ⭐ 360) - A package built to support working with spatial data using open source.. BSD-3
Show 8 hidden projects...
- Geocoder (
🥈 32 ·⭐ 1.4K ·💀 ) - Python Geocoder.MIT
- Satpy (
🥉 30 ·⭐ 840) - Python package for earth-observing satellite data processing.❗️GPL-3.0
- Sentinelsat (
🥉 27 ·⭐ 750) - Search and download Copernicus Sentinel satellite images.❗️GPL-3.0
- gmaps (
🥉 24 ·⭐ 740 ·💀 ) - Google maps for Jupyter notebooks.BSD-3
- Mapbox GL (
🥉 23 ·⭐ 610 ·💀 ) - Use Mapbox GL JS to visualize data in a Python Jupyter notebook.MIT
- pymap3d (
🥉 22 ·⭐ 250) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef..BSD-2
- geoplotlib (
🥉 21 ·⭐ 960 ·💀 ) - python toolbox for visualizing geographical data and making maps.MIT
- prettymaps (
🥉 17 ·⭐ 8K) - A small set of Python functions to draw pretty maps from..❗️AGPL-3.0
Financial Data
Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.