The Wayback Machine - https://web.archive.org/web/20210724223406/https://github.com/topics/inference-optimization
Here are
14 public repositories
matching this topic...
High-efficiency floating-point neural network inference operators for mobile, server, and Web
The Tensor Algebra SuperOptimizer for Deep Learning
Batch normalization fusion for PyTorch
-
Updated
Apr 6, 2020
-
Python
Optimize layers structure of Keras model to reduce computation time
-
Updated
Jul 18, 2020
-
Python
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
A set of tool which would make your life easier with Tensorrt and Onnxruntime. This Repo is designed for YoloV3
-
Updated
Dec 31, 2019
-
Python
Modified inference engine for quantized convolution using product quantization
Batch estimation on Lie groups
-
Updated
Jul 20, 2021
-
MATLAB
A constrained expectation-maximization algorithm for feasible graph inference.
-
Updated
Jun 10, 2021
-
Jupyter Notebook
ncnn is a high-performance neural network inference framework optimized for the mobile platform
PyTorch Mobile: Android examples of usage in applications
-
Updated
Oct 10, 2019
-
Java
PyTorch Mobile: iOS examples
-
Updated
Oct 10, 2019
-
Swift
A simple tool that applies structure-level optimizations (e.g. Quantization) to a TensorFlow model
-
Updated
Aug 13, 2018
-
Python
MIVisionX Python Inference Analyzer uses pre-trained ONNX/NNEF/Caffe models to analyze inference results and summarize individual image results
-
Updated
Nov 17, 2020
-
Python
Improve this page
Add a description, image, and links to the
inference-optimization
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
inference-optimization
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.