#
neon
Here are 185 public repositories matching this topic...
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
-
Updated
Aug 7, 2020 - C++
Intel® Nervana™ reference deep learning framework committed to best performance on all hardware
-
Updated
May 22, 2019 - Python
The ARM Computer Vision and Machine Learning library is a set of functions optimised for both ARM CPUs and GPUs using SIMD technologies.
android
machine-learning
arm
computer-vision
neural-network
cpp
neon
opencl
simd
armv7
aarch64
armv8
-
Updated
Aug 28, 2020 - C++
C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
c-plus-plus
machine-learning
arm
neural-network
neon
image-processing
avx
sse
simd
avx2
sse2
sse41
avx512
powerpc
altivec
vsx
ssse3
simd-library
haar-cascade
lbp
-
Updated
Aug 28, 2020 - C++
SIMD Vector Classes for C++
c-plus-plus
cpp
portable
neon
cpp14
parallel
parallel-computing
avx
sse
cpp11
simd
cpp17
avx2
simd-programming
vectorization
avx512
simd-instructions
simd-vector
data-parallel
-
Updated
Aug 28, 2020 - C++
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
cpp
neon
avx
sse
simd
c-plus-plus-14
vectorization
avx512
mathematical-functions
simd-instructions
simd-intrinsics
-
Updated
Aug 11, 2020 - C++
Open
AVX-512BW functions
8
nemequ
commented
Mar 10, 2020
https://software.intel.com/sites/landingpage/IntrinsicsGuide/#avx512techs=AVX512BW
- _cvtmask32_u32
- _cvtmask64_u64
- _cvtu32_mask32
- _cvtu64_mask64
- _kadd_mask32
- _kadd_mask64
- _kand_mask32
- _kand_mask64
- _kandn_mask32
- _kandn_mask64
- _knot_mask32
- _knot_mask64
- _kor_mask32
- _kor_mask64
- _kort
18
c
euler
opengl
math
postfix
neon
vector
matrix
bezier
avx
sse
simd
affine-transform-matrices
opengl-math
3d
bounding-boxes
matrix-decompositions
frustum
3d-math
marix-inverse
glm-for-c
-
Updated
Aug 9, 2020 - C
Native Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
-
Updated
Jun 4, 2020 - Go
C++ SIMD Noise Library
neon
simplex
fractal
sse
simd
noise
cellular
avx2
perlin
perlin-noise
white-noise
noise-library
noise-3d
fastnoise-simd
simplex-noise
fastnoise
-
Updated
Jun 18, 2020 - C++
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
arm
neon
sse
simd
x86
arm64
aarch64
armv7l
biilabs
armv8-a
intel-sse-intrinsics
neon-intrinsics
sse-intrinsics
sse2neon
-
Updated
Aug 28, 2020 - C
JeVois smart machine vision framework
-
Updated
Aug 13, 2020 - C
Math library using hlsl syntax with SSE/NEON support
math
cpp
shaders
neon
c-plus-plus-11
vector
matrix
modern-cpp
game-development
sse
quaternion
variants
hlsl
sse41
ser
-
Updated
Aug 21, 2020 - C++
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
-
Updated
Mar 4, 2020 - C++
Agenium Scale vectorization library for CPUs and GPUs
neon
cuda
avx
simd
avx2
sse2
simd-programming
aarch64
avx512
simd-instructions
sse42
rocm
sve
neon128
vectorization-library
-
Updated
Aug 28, 2020 - Python
Efficient monocular visual odometry for ground vehicles on ARM processors
-
Updated
May 15, 2020 - C++
Turbo Base64 - Fastest Base64 SIMD/Neon/Altivec
encoding
benchmark
arm
library
base64
neon
avx
sse
simd
avx2
base64-encoding
base64-decoding
encoding-library
-
Updated
Aug 17, 2020 - C
BLAKE3 hashing for JavaScript: native Node bindings (where available) and WebAssembly
-
Updated
Aug 1, 2020 - TypeScript
Improve this page
Add a description, image, and links to the neon topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the neon topic, visit your repo's landing page and select "manage topics."
https://github.com/simd-everywhere/simde has done their first release ( https://simd-everywhere.github.io/blog/announcements/release/2020/06/21/0.5.0-release.html ) and it seems like something potentially useful for merging code paths. The most likely candidate I imagine is using SIMD code to replace the non-SIMD fallback path, but maybe it'd be worth checking if it can be used to merge ARM and x8