This curated list contains 840 awesome open-source projects with a total of 2.9M stars grouped into 32 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!
Contents
- Machine Learning Frameworks 54 projects
- Data Visualization 49 projects
- Text Data & NLP 82 projects
- Image Data 49 projects
- Graph Data 29 projects
- Audio Data 24 projects
- Geospatial Data 22 projects
- Financial Data 23 projects
- Time Series Data 20 projects
- Medical Data 19 projects
- Optical Character Recognition 11 projects
- Data Containers & Structures 28 projects
- Data Loading & Extraction 23 projects
- Web Scraping & Crawling 1 projects
- Data Pipelines & Streaming 36 projects
- Distributed Machine Learning 26 projects
- Hyperparameter Optimization & AutoML 45 projects
- Reinforcement Learning 19 projects
- Recommender Systems 14 projects
- Privacy Machine Learning 6 projects
- Workflow & Experiment Tracking 35 projects
- Model Serialization & Conversion 11 projects
- Model Interpretability 46 projects
- Vector Similarity Search (ANN) 12 projects
- Probabilistics & Statistics 21 projects
- Adversarial Robustness 8 projects
- GPU Utilities 18 projects
- Tensorflow Utilities 13 projects
- Sklearn Utilities 17 projects
- Pytorch Utilities 27 projects
- Database Clients 1 projects
- Others 53 projects
Explanation
🥇 🥈 🥉 Combined project-quality score⭐️ Star count from GitHub🐣 New project (less than 6 months old)💤 Inactive project (6 months no activity)💀 Dead project (12 months no activity)📈 📉 Project is trending up or down➕ Project was recently added❗️ Warning (e.g. missing/risky license)👨💻 Contributors count from GitHub🔀 Fork count from GitHub📋 Issue count from GitHub⏱️ Last update timestamp on package manager📥 Download count from package manager📦 Number of dependent projectsTensorflow related project
Sklearn related project
PyTorch related project
MxNet related project
Apache Spark related project
Jupyter related project
PaddlePaddle related project
Pandas related project
Machine Learning Frameworks
General-purpose machine learning and deep learning frameworks.
Tensorflow (🥇 44 · ⭐ 160K) - An Open Source Machine Learning Framework for Everyone. Apache-2

-
GitHub (
👨💻 3.7K ·🔀 85K ·📦 140K ·📋 32K - 12% open ·⏱️ 03.06.2021):git clone https://github.com/tensorflow/tensorflow
-
PyPi (
📥 10M / month ·📦 23K ·⏱️ 13.05.2021):pip install tensorflow
-
Conda (
📥 2.6M ·⏱️ 30.04.2021):conda install -c conda-forge tensorflow
-
Docker Hub (
📥 53M ·⭐ 1.9K ·⏱️ 03.06.2021):docker pull tensorflow/tensorflow
scikit-learn (🥇 37 · ⭐ 46K) - scikit-learn: machine learning in Python. BSD-3

XGBoost (🥇 37 · ⭐ 21K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
LightGBM (🥇 36 · ⭐ 13K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
pytorch-lightning (🥈 34 · ⭐ 14K) - The lightweight PyTorch wrapper for high-performance.. Apache-2

Theano (🥈 34 · ⭐ 9.4K) - Theano was a Python library that allows you to define, optimize, and.. BSD-3
StatsModels (🥈 33 · ⭐ 6.4K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
Thinc (🥈 32 · ⭐ 2.3K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
PaddlePaddle (🥈 31 · ⭐ 16K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2

Catboost (🥈 31 · ⭐ 5.9K) - A fast, scalable, high performance Gradient Boosting on Decision.. Apache-2
Vowpal Wabbit (🥈 30 · ⭐ 7.6K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
tensorpack (🥈 29 · ⭐ 6K) - A Neural Net Training Interface on TensorFlow, with focus on.. Apache-2

Turi Create (🥉 26 · ⭐ 10K) - Turi Create simplifies the development of custom machine learning.. BSD-3
Jina (🥉 26 · ⭐ 3.6K) - An easier way to build neural search on the cloud. Apache-2
-
GitHub (
👨💻 99 ·🔀 460 ·📦 64 ·📋 800 - 7% open ·⏱️ 03.06.2021):git clone https://github.com/jina-ai/jina
-
PyPi (
📥 8K / month ·⏱️ 03.06.2021):pip install jina
-
Docker Hub (
📥 510K ·⭐ 3 ·⏱️ 03.06.2021):docker pull jinaai/jina
Neural Network Libraries (🥉 25 · ⭐ 2.5K) - Neural Network Libraries. Apache-2
tensorflow-upstream (🥉 25 · ⭐ 560) - TensorFlow ROCm port. Apache-2

SHOGUN (🥉 22 · ⭐ 2.8K) - Unified and efficient Machine Learning. BSD-3
-
GitHub (
👨💻 250 ·🔀 1K ·📋 1.5K - 30% open ·⏱️ 08.12.2020):git clone https://github.com/shogun-toolbox/shogun
-
Conda (
📥 100K ·⏱️ 25.06.2018):conda install -c conda-forge shogun
-
Docker Hub (
📥 1.5K ·⭐ 1 ·⏱️ 31.01.2019):docker pull shogun/shogun
Neural Tangents (🥉 22 · ⭐ 1.4K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
mace (🥉 21 · ⭐ 4.4K) - MACE is a deep learning inference framework optimized for mobile.. Apache-2
-
GitHub (
👨💻 61 ·🔀 770 ·📥 1.4K ·📋 640 - 6% open ·⏱️ 03.06.2021):git clone https://github.com/XiaoMi/mace
Haiku (🥉 21 · ⭐ 1.1K) - JAX-based neural network library. Apache-2
-
GitHub (
👨💻 41 ·🔀 78 ·📦 98 ·📋 88 - 22% open ·⏱️ 03.06.2021):git clone https://github.com/deepmind/dm-haiku
ThunderSVM (🥉 19 · ⭐ 1.3K) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
Torchbearer (🥉 19 · ⭐ 600) - torchbearer: A model fitting library for PyTorch. MIT

ThunderGBM (🥉 16 · ⭐ 600) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
NeoML (🥉 14 · ⭐ 590) - Machine learning framework for both deep learning and traditional.. Apache-2
-
GitHub (
👨💻 21 ·🔀 86 ·📋 49 - 73% open ·⏱️ 01.06.2021):git clone https://github.com/neoml-lib/neoml
Show 9 hidden projects...
- dlib (
🥈 32 ·⭐ 10K) - A toolkit for making real world machine learning and data analysis..❗️BSL-1.0
- CNTK (
🥉 26 ·⭐ 17K ·💀 ) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit.MIT
- NuPIC (
🥉 24 ·⭐ 6.2K ·💀 ) - Numenta Platform for Intelligent Computing is an implementation..❗️AGPL-3.0
- Lasagne (
🥉 24 ·⭐ 3.8K ·💀 ) - Lightweight library to build and train neural networks in Theano.MIT
- xLearn (
🥉 24 ·⭐ 2.9K ·💀 ) - High performance, easy-to-use, and scalable machine learning (ML)..Apache-2
- neon (
🥉 23 ·⭐ 3.9K ·💀 ) - Intel Nervana reference deep learning framework committed to best..Apache-2
- NeuPy (
🥉 22 ·⭐ 680 ·💀 ) - NeuPy is a Tensorflow based python library for prototyping and building..MIT
- MindsDB (
🥉 20 ·⭐ 3.7K) - Predictive AI layer for existing databases.❗️GPL-3.0
- StarSpace (
🥉 14 ·⭐ 3.6K ·💀 ) - Learning embeddings for classification, retrieval and ranking.MIT
Data Visualization
General-purpose and task-specific data visualization libraries.
Matplotlib (🥇 41 · ⭐ 14K) - matplotlib: plotting with Python. Python-2.0
Plotly (🥇 36 · ⭐ 9.6K) - The interactive graphing library for Python (includes Plotly Express). MIT
-
GitHub (
👨💻 180 ·🔀 1.8K ·📦 5 ·📋 2K - 44% open ·⏱️ 01.06.2021):git clone https://github.com/plotly/plotly.py
-
PyPi (
📥 5.4M / month ·📦 5K ·⏱️ 31.05.2021):pip install plotly
-
Conda (
📥 1.4M ·⏱️ 12.01.2021):conda install -c conda-forge plotly
-
NPM (
📥 36K / month ·📦 4 ·⏱️ 12.01.2021):npm install plotlywidget
dash (🥇 35 · ⭐ 15K) - Analytical Web Apps for Python, R, Julia, and Jupyter. No JavaScript Required. MIT
pandas-profiling (🥈 31 · ⭐ 7.4K) - Create HTML profiling reports from pandas DataFrame.. MIT


HoloViews (🥈 29 · ⭐ 1.9K) - With Holoviews, your data visualizes itself. BSD-3

-
GitHub (
👨💻 110 ·🔀 320 ·📋 2.6K - 27% open ·⏱️ 28.05.2021):git clone https://github.com/holoviz/holoviews
-
PyPi (
📥 110K / month ·📦 170 ·⏱️ 22.05.2021):pip install holoviews
-
Conda (
📥 500K ·⏱️ 23.05.2021):conda install -c conda-forge holoviews
-
NPM (
📥 5.3K / month ·⏱️ 24.05.2020):npm install @pyviz/jupyterlab_pyviz
bqplot (🥈 28 · ⭐ 3.1K) - Plotting library for IPython/Jupyter notebooks. Apache-2

-
GitHub (
👨💻 51 ·🔀 400 ·📦 26 ·📋 530 - 36% open ·⏱️ 03.06.2021):git clone https://github.com/bqplot/bqplot
-
PyPi (
📥 48K / month ·📦 110 ·⏱️ 25.05.2021):pip install bqplot
-
Conda (
📥 630K ·⏱️ 26.05.2021):conda install -c conda-forge bqplot
-
NPM (
📥 15K / month ·📦 10 ·⏱️ 25.05.2021):npm install bqplot
datashader (🥈 28 · ⭐ 2.5K) - Quickly and accurately render even the largest data. BSD-3
Perspective (🥈 27 · ⭐ 3.4K) - Streaming pivot visualization via WebAssembly. Apache-2

data-validation (🥈 27 · ⭐ 550) - Library for exploring and validating machine learning.. Apache-2


Facets Overview (🥉 26 · ⭐ 6.6K) - Visualizations for machine learning datasets. Apache-2

pythreejs (🥉 25 · ⭐ 730) - A Jupyter - Three.js bridge. BSD-3

-
GitHub (
👨💻 27 ·🔀 160 ·📦 17 ·📋 200 - 30% open ·⏱️ 26.02.2021):git clone https://github.com/jupyter-widgets/pythreejs
-
PyPi (
📥 29K / month ·📦 26 ·⏱️ 26.02.2021):pip install pythreejs
-
Conda (
📥 300K ·⏱️ 02.03.2021):conda install -c conda-forge pythreejs
-
NPM (
📥 7.9K / month ·📦 8 ·⏱️ 26.02.2021):npm install jupyter-threejs
hvPlot (🥉 25 · ⭐ 400) - A high-level plotting API for pandas, dask, xarray, and networkx built on.. BSD-3
D-Tale (🥉 24 · ⭐ 2.4K) - Visualizer for pandas data structures. ❗️LGPL-2.1


HyperTools (🥉 24 · ⭐ 1.6K) - A Python toolbox for gaining geometric insights into high-dimensional.. MIT
Chartify (🥉 23 · ⭐ 2.9K) - Python library that makes it easy for data scientists to create.. Apache-2
Multicore-TSNE (🥉 23 · ⭐ 1.6K · 💤 ) - Parallel t-SNE implementation with Python and Torch.. BSD-3

Pandas-Bokeh (🥉 23 · ⭐ 690) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT

python-ternary (🥉 23 · ⭐ 420) - Ternary plotting library for python with matplotlib. MIT
Sweetviz (🥉 21 · ⭐ 1.6K) - Visualize and compare datasets, target values and associations, with one.. MIT
AutoViz (🥉 19 · ⭐ 370) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
animatplot (🥉 19 · ⭐ 370 · 💤 ) - A python package for animating plots build on matplotlib. MIT
data-describe (🥉 15 · ⭐ 280) - datadescribe: Pythonic EDA Accelerator for Data Science. Apache-2
Show 4 hidden projects...
- plotnine (
🥈 28 ·⭐ 2.7K) - A grammar of graphics for Python.❗️GPL-2.0
- pivottablejs (
🥉 21 ·⭐ 430 ·💀 ) - Dragndrop Pivot Tables and Charts for Jupyter/IPython..MIT
- pdvega (
🥉 16 ·⭐ 340 ·💀 ) - Interactive plotting for Pandas using Vega-Lite.MIT
- nptsne (
🥉 12 ·⭐ 25) - nptsne is a numpy compatible python binary package that offers a number..Apache-2
Text Data & NLP
Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.
transformers (🥇 36 · ⭐ 47K) - Transformers: State-of-the-art Natural Language.. Apache-2


gensim (🥇 36 · ⭐ 12K) - Topic Modelling for Humans. ❗️LGPL-2.1
nltk (🥇 35 · ⭐ 9.9K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
ChatterBot (🥇 31 · ⭐ 11K) - ChatterBot is a machine learning, conversational dialog engine for.. BSD-3
flair (🥇 31 · ⭐ 10K) - A very simple framework for state-of-the-art Natural Language Processing.. MIT

sentencepiece (🥇 31 · ⭐ 5.1K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
snowballstemmer (🥇 31 · ⭐ 500) - Snowball compiler and stemming algorithms. BSD-3
TextBlob (🥈 30 · ⭐ 7.7K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech.. MIT
sentence-transformers (🥈 29 · ⭐ 5.2K) - Multilingual Sentence & Image Embeddings with BERT. Apache-2

DeepPavlov (🥈 28 · ⭐ 5.2K) - An open source library for deep learning end-to-end dialog.. Apache-2

Tokenizers (🥈 28 · ⭐ 4.6K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
phonenumbers (🥈 28 · ⭐ 2.7K) - Python port of Google's libphonenumber. Apache-2
TensorFlow Text (🥈 27 · ⭐ 740) - Making text a first-class citizen in TensorFlow. Apache-2

vaderSentiment (🥈 26 · ⭐ 3K) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary and.. MIT
haystack (🥈 26 · ⭐ 1.9K) - End-to-end Python framework for building natural language search.. Apache-2
inflect (🥈 26 · ⭐ 520) - Correctly generate plurals, ordinals, indefinite articles; convert numbers.. MIT
textgenrnn (🥉 25 · ⭐ 4.4K · 💤 ) - Easily train your own text-generating neural network of any.. MIT

Sumy (🥉 25 · ⭐ 2.6K · 💤 ) - Module for automatic summarization of text documents and HTML pages. Apache-2
TextDistance (🥉 25 · ⭐ 2K) - Compute distance between sequences. 30+ algorithms, pure python.. MIT
scattertext (🥉 25 · ⭐ 1.6K) - Beautiful visualizations of how language differs among document.. Apache-2
pyahocorasick (🥉 25 · ⭐ 610) - Python module (C extension and plain python) implementing Aho-.. BSD-3
Ciphey (🥉 24 · ⭐ 7K) - Automatically decrypt encryptions without knowing the key or cipher, decode.. MIT
-
GitHub (
👨💻 44 ·🔀 390 ·📋 260 - 23% open ·⏱️ 28.05.2021):git clone https://github.com/Ciphey/Ciphey
-
PyPi (
📥 7.7K / month ·⏱️ 23.05.2021):pip install ciphey
-
Docker Hub (
📥 9.8K ·⭐ 3 ·⏱️ 24.05.2021):docker pull remnux/ciphey
pytorch-nlp (🥉 24 · ⭐ 1.9K) - Basic Utilities for PyTorch Natural Language Processing (NLP). BSD-3

spacy-transformers (🥉 24 · ⭐ 960) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT
spacy
DeepMatcher (🥉 23 · ⭐ 3.6K) - Python package for performing Entity and Text Matching using Deep.. BSD-3
neuralcoref (🥉 23 · ⭐ 2.3K) - Fast Coreference Resolution in spaCy with Neural Networks. MIT
fastNLP (🥉 23 · ⭐ 2.2K) - fastNLP: A Modularized and Extensible NLP Framework. Currently still.. Apache-2
PyTextRank (🥉 23 · ⭐ 1.5K) - Python implementation of TextRank for phrase extraction and.. MIT
pySBD (🥉 23 · ⭐ 320) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence.. MIT
gpt-2-simple (🥉 22 · ⭐ 2.7K) - Python package to easily retrain OpenAI's GPT-2 text-.. MIT

Texar (🥉 22 · ⭐ 2.2K · 💤 ) - Toolkit for Machine Learning, Natural Language Processing, and.. Apache-2

SciSpacy (🥉 22 · ⭐ 930) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
NLP Architect (🥉 20 · ⭐ 2.7K) - A model library for exploring state-of-the-art deep learning.. Apache-2
Texthero (🥉 20 · ⭐ 2.2K) - Text preprocessing, representation and visualization from zero to hero. MIT
DELTA (🥉 20 · ⭐ 1.4K) - DELTA is a deep learning based natural language and speech.. Apache-2

-
GitHub (
👨💻 41 ·🔀 290 ·📋 79 - 11% open ·⏱️ 17.12.2020):git clone https://github.com/Delta-ML/delta
-
PyPi (
📥 13 / month ·⏱️ 27.03.2020):pip install delta-nlp
-
Docker Hub (
📥 13K ·⏱️ 01.06.2021):docker pull zh794390558/delta
YouTokenToMe (🥉 20 · ⭐ 740) - Unsupervised text tokenizer focused on computational efficiency. MIT
OpenNRE (🥉 15 · ⭐ 3.1K) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
-
GitHub (
👨💻 9 ·🔀 860 ·📋 330 - 6% open ·⏱️ 31.05.2021):git clone https://github.com/thunlp/OpenNRE
NeuralQA (🥉 15 · ⭐ 200) - NeuralQA: A Usable Library for Question Answering on Large Datasets with.. MIT
ONNX-T5 (🥉 15 · ⭐ 160) - Summarization, translation, sentiment-analysis, text-generation and.. Apache-2
Show 9 hidden projects...
- fuzzywuzzy (
🥇 31 ·⭐ 8.1K) - Fuzzy String Matching in Python.❗️GPL-2.0
- langid (
🥈 27 ·⭐ 1.8K ·💀 ) - Stand-alone language identification system.BSD-3
- polyglot (
🥈 26 ·⭐ 1.8K ·💤 ) - Multilingual text (NLP) processing toolkit.❗️GPL-3.0
- flashtext (
🥉 24 ·⭐ 4.8K ·💀 ) - Extract Keywords from sentence or Replace keywords in sentences.MIT
- stop-words (
🥉 22 ·⭐ 130 ·💀 ) - Get list of common stop words in various languages in Python.BSD-3
- NeuroNER (
🥉 19 ·⭐ 1.6K ·💀 ) - Named-entity recognition using neural networks. Easy-to-use and..MIT
- pyfasttext (
🥉 19 ·⭐ 230 ·💀 ) - Yet another Python binding for fastText.❗️GPL-3.0
- Headliner (
🥉 15 ·⭐ 230 ·💀 ) - Easy training and deployment of seq2seq models.MIT
- TransferNLP (
🥉 14 ·⭐ 290 ·💀 ) - NLP library designed for reproducible experimentation..MIT
Image Data
Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.
torchvision (🥇 36 · ⭐ 9.2K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3

scikit-image (🥇 34 · ⭐ 4.4K) - Image processing in Python. BSD-2
opencv-python (🥈 31 · ⭐ 2K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
Face Recognition (🥈 30 · ⭐ 40K · 💤 ) - The world's simplest facial recognition api for.. MIT

Albumentations (🥈 30 · ⭐ 8.1K · 📈 ) - Fast image augmentation library and an easy-to-use.. MIT

detectron2 (🥈 29 · ⭐ 17K) - Detectron2 is FAIR's next-generation platform for object.. Apache-2

PyTorch Image Models (🥈 28 · ⭐ 11K) - PyTorch image models, scripts, pretrained weights --.. Apache-2

-
GitHub (
👨💻 46 ·🔀 1.5K ·📥 410K ·📦 620 ·📋 320 - 9% open ·⏱️ 31.05.2021):git clone https://github.com/rwightman/pytorch-image-models
imageai (🥈 28 · ⭐ 6.2K) - A python library built to empower developers to build applications and.. MIT
InsightFace (🥈 27 · ⭐ 9.3K) - Face Analysis Project on MXNet and PyTorch. MIT

MMDetection (🥈 26 · ⭐ 15K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2

-
GitHub (
👨💻 220 ·🔀 5.3K ·📦 58 ·📋 3.8K - 7% open ·⏱️ 03.06.2021):git clone https://github.com/open-mmlab/mmdetection
facenet-pytorch (🥈 26 · ⭐ 2.1K) - Pretrained Pytorch face detection (MTCNN) and facial.. MIT

Face Alignment (🥉 24 · ⭐ 5K) - 2D and 3D Face alignment library build using pytorch. BSD-3

Image Super-Resolution (🥉 23 · ⭐ 2.8K) - Super-scale your images and run experiments with.. Apache-2

-
GitHub (
👨💻 10 ·🔀 540 ·📦 50 ·📋 180 - 40% open ·⏱️ 02.06.2021):git clone https://github.com/idealo/image-super-resolution
-
PyPi (
📥 5.4K / month ·📦 8 ·⏱️ 08.01.2020):pip install ISR
-
Docker Hub (
📥 150 ·⏱️ 01.04.2019):docker pull idealo/image-super-resolution-gpu
Torch Points 3D (🥉 23 · ⭐ 1.3K) - Pytorch framework for doing deep learning on point clouds. BSD-3

CellProfiler (🥉 23 · ⭐ 580) - An open-source application for biological image analysis. BSD-3
tensorflow-graphics (🥉 22 · ⭐ 2.5K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2

vidgear (🥉 22 · ⭐ 1.8K) - High-performance cross-platform Video Processing Python framework.. Apache-2
vit-pytorch (🥉 21 · ⭐ 4.5K) - Implementation of Vision Transformer, a simple way to achieve.. MIT

Image Deduplicator (🥉 21 · ⭐ 3.6K · 💤 ) - Finding duplicate images made easy!. Apache-2

Classy Vision (🥉 20 · ⭐ 1.3K) - An end-to-end PyTorch framework for image and video.. MIT

image-match (🥉 19 · ⭐ 2.6K) - Quickly search over billions of images. Apache-2
PaddleDetection (🥉 18 · ⭐ 4.1K) - Object detection and instance segmentation toolkit.. Apache-2

-
GitHub (
👨💻 60 ·🔀 990 ·📦 5 ·📋 1.8K - 21% open ·⏱️ 03.06.2021):git clone https://github.com/PaddlePaddle/PaddleDetection
Norfair (🥉 18 · ⭐ 1K) - Lightweight Python library for adding real-time 2D object tracking to any.. BSD-3
pycls (🥉 17 · ⭐ 1.6K · 📈 ) - Codebase for Image Classification Research, written in PyTorch. MIT

-
GitHub (
👨💻 11 ·🔀 180 ·📦 3 ·📋 64 - 23% open ·⏱️ 21.05.2021):git clone https://github.com/facebookresearch/pycls
PySlowFast (🥉 16 · ⭐ 3.8K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2

-
GitHub (
👨💻 23 ·🔀 720 ·📦 4 ·📋 400 - 50% open ·⏱️ 03.06.2021):git clone https://github.com/facebookresearch/SlowFast
DE⫶TR (🥉 15 · ⭐ 7K · 💤 ) - End-to-End Object Detection with Transformers. Apache-2

-
GitHub (
👨💻 20 ·🔀 1.1K ·📋 330 - 25% open ·⏱️ 15.11.2020):git clone https://github.com/facebookresearch/detr
Show 6 hidden projects...
- glfw (
🥈 30 ·⭐ 7.7K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input.❗️Zlib
- Augmentor (
🥉 25 ·⭐ 4.4K ·💀 ) - Image augmentation library in Python for machine learning.MIT
- chainercv (
🥉 25 ·⭐ 1.5K ·💀 ) - ChainerCV: a Library for Deep Learning in Computer Vision.MIT
- segmentation_models (
🥉 23 ·⭐ 3.2K ·💀 ) - Segmentation models with pretrained backbones. Keras..MIT
- Pillow-SIMD (
🥉 23 ·⭐ 1.6K ·💤 ) - The friendly PIL fork.❗️PIL
- Luminoth (
🥉 22 ·⭐ 2.4K ·💀 ) - Deep Learning toolkit for Computer Vision.BSD-3
Graph Data
Libraries for graph processing, clustering, embedding, and machine learning tasks.
PyTorch Geometric (🥇 29 · ⭐ 11K) - Geometric Deep Learning Extension Library for PyTorch. MIT

dgl (🥈 27 · ⭐ 7.4K) - Python package built to ease deep learning on graph, on top of existing.. Apache-2
StellarGraph (🥈 26 · ⭐ 1.9K) - StellarGraph - Machine Learning on Graphs. Apache-2

AmpliGraph (🥈 23 · ⭐ 1.5K) - Python library for Representation Learning on Knowledge.. Apache-2

ogb (🥈 23 · ⭐ 950) - Benchmark datasets, data loaders, and evaluators for graph machine learning. MIT
torch-cluster (🥈 22 · ⭐ 380) - PyTorch Extension Library of Optimized Graph Cluster.. MIT

PyTorch-BigGraph (🥉 20 · ⭐ 2.8K) - Generate embeddings from large-scale graph-structured.. BSD-3

graph-nets (🥉 19 · ⭐ 4.9K) - Build Graph Nets in Tensorflow. Apache-2

Paddle Graph Learning (🥉 19 · ⭐ 1K) - Paddle Graph Learning (PGL) is an efficient and.. Apache-2

pytorch_geometric_temporal (🥉 18 · ⭐ 780) - A Temporal Extension Library for PyTorch Geometric. MIT

GraphEmbedding (🥉 15 · ⭐ 2K · 💤 ) - Implementation and experiments of graph embedding.. MIT

-
GitHub (
👨💻 8 ·🔀 620 ·📦 9 ·📋 49 - 71% open ·⏱️ 18.10.2020):git clone https://github.com/shenweichen/GraphEmbedding
OpenKE (🥉 14 · ⭐ 2.6K) - An Open-Source Package for Knowledge Embedding (KE). MIT
-
GitHub (
👨💻 10 ·🔀 790 ·📋 300 - 21% open ·⏱️ 06.04.2021):git clone https://github.com/thunlp/OpenKE
GraphVite (🥉 12 · ⭐ 910) - GraphVite: A General and High-performance Graph Embedding System. Apache-2
Show 8 hidden projects...
- igraph (
🥇 28 ·⭐ 830) - Python interface for igraph.❗️GPL-2.0
- pygal (
🥈 27 ·⭐ 2.4K) - PYthon svg GrAph plotting Library.❗️LGPL-3.0
- Karate Club (
🥈 22 ·⭐ 1.3K) - Karate Club: An API Oriented Open-source Python Framework for..❗️GPL-3.0
- DeepWalk (
🥉 20 ·⭐ 2.3K ·💀 ) - DeepWalk - Deep Learning for Graphs.❗️GPL-3.0
- Sematch (
🥉 16 ·⭐ 350 ·💀 ) - semantic similarity framework for knowledge graph.Apache-2
- pyRDF2Vec (
🥉 16 ·⭐ 120) - Python Implementation and Extension of RDF2Vec.MIT
- GraphSAGE (
🥉 14 ·⭐ 2.3K ·💀 ) - Representation learning on large graphs using stochastic..MIT
- OpenNE (
🥉 13 ·⭐ 1.5K ·💀 ) - An Open-Source Package for Network Embedding (NE).MIT
Audio Data
Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.
torchaudio (🥇 29 · ⭐ 1.3K) - Data manipulation and transformation for audio signal.. BSD-2

pyAudioAnalysis (🥈 27 · ⭐ 3.9K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
python-soundfile (🥈 25 · ⭐ 380) - SoundFile is an audio library based on libsndfile, CFFI, and.. BSD-3
DeepSpeech (🥉 24 · ⭐ 17K · 📉 ) - DeepSpeech is an open source embedded (offline, on-.. MPL-2.0

python_speech_features (🥉 24 · ⭐ 1.9K) - This library provides common speech features for ASR.. MIT
tinytag (🥉 22 · ⭐ 460) - Read music meta data and length of MP3, OGG, OPUS, MP4, M4A, FLAC, WMA and.. MIT
TTS (🥉 20 · ⭐ 4.8K) - Deep learning for Text to Speech (Discussion forum:.. MPL-2.0
-
GitHub (
👨💻 55 ·🔀 770 ·📥 680 ·📋 500 - 7% open ·⏱️ 12.02.2021):git clone https://github.com/mozilla/TTS
Show 4 hidden projects...
- SpeechRecognition (
🥇 29 ·⭐ 5.7K ·💀 ) - Speech recognition module for Python, supporting..BSD-3
- aubio (
🥈 26 ·⭐ 2.1K) - a library for audio and music analysis.❗️GPL-3.0
- Essentia (
🥉 24 ·⭐ 1.8K) - C++ library for audio and music analysis, description and..❗️AGPL-3.0
- Madmom (
🥉 22 ·⭐ 740 ·💀 ) - Python audio and music signal processing library.BSD-3
Geospatial Data
Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.
pydeck (🥇 34 · ⭐ 8.7K) - WebGL2 powered geospatial visualization layers. MIT

-
GitHub (
👨💻 160 ·🔀 1.6K ·📦 1.6K ·📋 2.1K - 4% open ·⏱️ 03.06.2021):git clone https://github.com/visgl/deck.gl
-
PyPi (
📥 290K / month ·📦 2 ·⏱️ 13.04.2021):pip install pydeck
-
Conda (
📥 33K ·⏱️ 13.04.2021):conda install -c conda-forge pydeck
-
NPM (
📥 180K / month ·📦 560 ·⏱️ 25.05.2021):npm install deck.gl
ipyleaflet (🥉 29 · ⭐ 1.1K) - A Jupyter - Leaflet.js bridge. MIT

-
GitHub (
👨💻 64 ·🔀 290 ·📦 860 ·📋 420 - 37% open ·⏱️ 12.05.2021):git clone https://github.com/jupyter-widgets/ipyleaflet
-
PyPi (
📥 45K / month ·📦 98 ·⏱️ 05.01.2021):pip install ipyleaflet
-
Conda (
📥 680K ·⏱️ 16.01.2021):conda install -c conda-forge ipyleaflet
-
NPM (
📥 22K / month ·📦 2 ·⏱️ 05.01.2021):npm install jupyter-leaflet
ArcGIS API (🥉 25 · ⭐ 1K) - Documentation and samples for ArcGIS API for Python. Apache-2
-
GitHub (
👨💻 62 ·🔀 740 ·📋 350 - 34% open ·⏱️ 25.05.2021):git clone https://github.com/Esri/arcgis-python-api
-
PyPi (
📥 47K / month ·📦 20 ·⏱️ 12.04.2021):pip install arcgis
-
Docker Hub (
📥 4.3K ·⭐ 32 ·⏱️ 06.03.2020):docker pull esridocker/arcgis-api-python-notebook
EarthPy (🥉 20 · ⭐ 260) - A package built to support working with spatial data using open source.. BSD-3
pymap3d (🥉 19 · ⭐ 190) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef.. BSD-2
Show 6 hidden projects...
- Geocoder (
🥈 30 ·⭐ 1.4K ·💀 ) - Python Geocoder.MIT
- Cartopy (
🥉 29 ·⭐ 1.5K) - Rasterio reads and writes geospatial raster datasets.❗️LGPL-3.0
- Sentinelsat (
🥉 23 ·⭐ 610 ·📈 ) - Search and download Copernicus Sentinel satellite images.❗️GPL-3.0
- gmaps (
🥉 22 ·⭐ 710 ·💀 ) - Google maps for Jupyter notebooks.BSD-3
- geoplotlib (
🥉 21 ·⭐ 920 ·💀 ) - python toolbox for visualizing geographical data and making maps.MIT
- Satpy (
🥉 21 ·⭐ 700) - Python package for earth-observing satellite data processing.❗️GPL-3.0
Financial Data
Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.
yfinance (🥇 29 · ⭐ 5.1K) - Yahoo! Finance market data downloader (+faster Pandas Datareader). Apache-2
Alpha Vantage (🥈 25 · ⭐ 3.4K) - A python wrapper for Alpha Vantage API for financial data. MIT
empyrical (🥈 24 · ⭐ 790 · 💤 ) - Common financial risk and performance metrics. Used by zipline.. Apache-2
TensorTrade (🥉 23 · ⭐ 3.2K) - An open source reinforcement learning framework for training,.. Apache-2
Enigma Catalyst (🥉 23 · ⭐ 2.2K) - An Algorithmic Trading Library for Crypto-Assets in Python. Apache-2
stockstats (🥉 22 · ⭐ 790 · 💤 ) - Supply a wrapper ``StockDataFrame`` based on the.. BSD-3
tf-quant-finance (🥉 20 · ⭐ 2.6K) - High-performance TensorFlow library for quantitative.. Apache-2

finmarketpy (🥉 20 · ⭐ 2.6K) - Python library for backtesting trading strategies & analyzing.. Apache-2
Crypto Signals (🥉 19 · ⭐ 3.1K · 💤 ) - Github.com/CryptoSignal - #1 Quant Trading & Technical.. MIT
-
GitHub (
👨💻 27 ·🔀 830 ·📋 240 - 18% open ·⏱️ 03.09.2020):git clone https://github.com/CryptoSignal/crypto-signal
-
Docker Hub (
📥 140K ·⭐ 7 ·⏱️ 03.09.2020):docker pull shadowreaver/crypto-signal
Show 7 hidden projects...
- backtrader (
🥈 26 ·⭐ 6.6K) - Python Backtesting library for trading strategies.❗️GPL-3.0
- Alphalens (
🥈 24 ·⭐ 1.9K ·💀 ) - Performance analysis of predictive (alpha) stock factors.Apache-2
- PyAlgoTrade (
🥉 23 ·⭐ 3.3K ·💀 ) - Python Algorithmic Trading Library.Apache-2
- FinTA (
🥉 22 ·⭐ 1.1K) - Common financial technical indicators implemented in Pandas.❗️LGPL-3.0
- arch (
🥉 22 ·⭐ 710) - ARCH models in Python.❗️NCSA
- Backtesting.py (
🥉 18 ·⭐ 1.4K) - Backtest trading strategies in Python.❗️AGPL-3.0
- surpriver (
🥉 12 ·⭐ 1.2K ·💤 ) - Find big moving stocks before they move using machine..❗️GPL-3.0
Time Series Data
Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.
Prophet (🥇 29 · ⭐ 13K) - Tool for producing high quality forecasts for time series data that has.. MIT
pmdarima (🥇 27 · ⭐ 900) - A statistical library designed to fill the void in Python's time series.. MIT
STUMPY (🥉 21 · ⭐ 1.8K) - STUMPY is a powerful and scalable Python library for computing a Matrix.. BSD-3
Darts (🥉 21 · ⭐ 890) - A python library for easy manipulation and forecasting of time series. Apache-2
-
GitHub (
👨💻 29 ·🔀 120 ·📦 5 ·📋 93 - 22% open ·⏱️ 22.05.2021):git clone https://github.com/unit8co/darts
-
PyPi (
📥 1.5K / month ·⏱️ 22.05.2021):pip install u8darts
-
Docker Hub (
📥 130 ·⏱️ 22.05.2021):docker pull unit8/darts
pytorch-forecasting (🥉 20 · ⭐ 1.1K) - Time series forecasting with PyTorch. MIT
tick (🥉 18 · ⭐ 340 · 💤 ) - Module for statistical learning, with a particular emphasis on time-.. BSD-3
Show 5 hidden projects...
- PyFlux (
🥈 23 ·⭐ 1.9K ·💀 ) - Open source time series library for Python.BSD-3
- luminol (
🥉 21 ·⭐ 930 ·💀 ) - Anomaly Detection and Correlation library.Apache-2
- pydlm (
🥉 20 ·⭐ 390 ·💀 ) - A python library for Bayesian time series modeling.BSD-3
- matrixprofile-ts (
🥉 19 ·⭐ 640 ·💀 ) - A Python library for detecting patterns and anomalies..Apache-2
- ADTK (
🥉 17 ·⭐ 660 ·💀 ) - A Python toolkit for rule-based/unsupervised anomaly detection in time..MPL-2.0
Medical Data
Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.