Research & Software Development Engineer

Artem
Shamsuarov

15+ years building production-grade computer vision systems. Specialized in real-time 3D reconstruction, multi-camera pipelines, SLAM, and GPU-accelerated processing. Published researcher with 7+ patents.

Get in Touch GitHub

15+

Years C++ / CUDA

Patents Granted

Countries Worked

Master's Degrees

Career

Work Experience

Artec Europe S.a.r.l.

Research & Software Development Engineer

08/2020 — Present · Luxembourg

Developed production-ready CV algorithms for multi-camera 3D scanning systems
Implemented real-time image processing pipelines handling large-scale visual datasets from structured light, laser, lidar, and RGB scanners
Achieved 10–100× performance improvements through CUDA optimization
Designed real-time tracking and fusion algorithms, automated defect detection, and quality control systems
Maintained CI/CD pipelines (Jenkins) and mentored junior engineers

Artec Group

Research & Software Development Engineer

09/2015 — 08/2020 · Moscow, Russia

Built real-time fusion algorithms for mobile 3D scanner with onboard processing
Implemented 3D reconstruction and SLAM algorithms for production scanning systems
Achieved 70% latency reduction through CUDA/OpenCL/Vulkan optimization
Designed next-best-view planning system and software architecture for distributed computing clusters
Built multi-platform apps (iOS/Swift/ObjC, Windows/C++) for industrial vision

Samsung R&D Institute Russia

Expert Engineer — Algorithm Laboratory

06/2012 — 09/2015 · Moscow, Russia

Developed CV algorithms and apps deployed to millions of mobile users
Built complete 3D reconstruction pipeline: SfM, SLAM, depth estimation, depth fusion
Implemented near-duplicate image retrieval and face detection systems
Published at IEEE CVPR Workshop; secured 3 patents

Samsung Electronics

R&D Engineer — Semiconductor Process Development

09/2009 — 06/2012 · Suwon, South Korea

Developed optimization algorithms for computational lithography at next-gen semiconductor nodes
Created C++/Qt/Python tools for Optical Proximity Correction and Resolution Enhancement
Collaborated with Cadence, Mentor Graphics, Synopsys; published at SPIE; obtained 2 US patents

Expertise

Technical Skills

⚡ Core Programming

C++ (Expert, 15+ yrs) CUDA (Expert) Python C# MATLAB OOP & Design Patterns STL / Boost Multithreading Memory Management

👁 Computer Vision

Real-time Image Processing Multi-camera Systems 3D Reconstruction SLAM Structure-from-Motion Feature Detection & Matching Object Detection Stereo Vision Bundle Adjustment Camera Calibration Optical Flow Depth Estimation

🧊 3D Processing

Point Cloud Processing Mesh Generation Surface Reconstruction Registration Real-time Fusion Spatial Data Structures Next-best-view Planning

🔧 GPU & Parallel Computing

CUDA Optimization OpenCL Vulkan Compute Metal CUB / Thrust OpenMP TBB MPI

📐 Math & Algorithms

Optimization Algorithms Computational Geometry Linear Algebra Numerical Methods Statistics Mathematical Modeling

🛠 Tools & Platforms

OpenCV PCL Open3D CGAL Colmap ROS Eigen Ceres Git CMake Docker Jenkins Linux / Windows Nvidia Jetson Qt

Pet Projects

Personal Explorations

Side projects spanning computer vision, quantitative finance, and creative tools — built from scratch to explore what interests me.

Computer Vision & 3D

cuda-gaussian-splatting

From-scratch implementation of 3D Gaussian Splatting in C++23/CUDA. Custom differentiable tile-based rasterizer with analytic gradients, fused Adam optimizer as a single CUDA kernel, MCMC densification, and a real-time OpenGL viewer.

Differentiable rasterizer Fused CUDA optimizer Real-time viewer

C++23 CUDA OpenGL libtorch Eigen

View on GitHub

OnyxVO

Real-time visual odometry on Android. XFeat learned features via ONNX Runtime, Vulkan compute descriptor matching through Kompute, ARM NEON SIMD preprocessing, and 5-point RANSAC pose estimation with keyframe management.

~12.5 FPS on Galaxy S21 0.3 ms NEON preprocess Vulkan compute matching

C++ Vulkan ONNX Runtime ARM NEON Kotlin

View on GitHub

DocShot

Zero-tap document scanner for Android. Auto-detects document boundaries, captures on stability, corrects perspective with sub-pixel corner refinement. Classical CV pipeline with multi-strategy preprocessing — no ML, no cloud.

~15 ms detection/frame ~120 ms full pipeline 74 tests

Kotlin Jetpack Compose OpenCV CameraX

View on GitHub

Quantitative Finance

hft-orderbook-engine

Low-latency limit order book and matching engine with lock-free SPSC ring buffers, zero-allocation hot path, slab allocator, and market microstructure analytics (spread, microprice, order flow imbalance, Kyle's Lambda).

~124 ns median add 6M msgs/s throughput 322 unit tests

C++17 Lock-free SPSC Zero-alloc Microstructure

View on GitHub

cuda-portfolio-optimizer

GPU-accelerated portfolio optimization with Monte Carlo scenario generation (cuRAND + Cholesky), Mean-CVaR via custom ADMM solver, PCA factor model (15.6x speedup), and rolling-window backtesting with transaction costs.

12.5x GPU speedup PCA factor model 4-strategy backtest

C++17 CUDA cuRAND ADMM Mean-CVaR

View on GitHub

Audio & Music

PickHero

Desktop guitar practice app — load any Guitar Pro tab (GP3–GP8), plug in via USB, and get real-time pitch detection with hit/miss feedback and scrolling tab playback. A lightweight, free Yousician alternative.

Real-time YIN pitch detection GP3–GP8 tab support MIDI backing tracks

Python aubio PyGuitarPro PyGame

View on GitHub

VocalForge

Desktop app for recording vocals over music tracks. Automatic vocal separation via Demucs, cross-correlation alignment of recording to backing track, LUFS normalization (ITU-R BS.1770-4), and one-click export of the final mix.

Demucs vocal separation Auto-alignment (xcorr) LUFS normalization

Python PySide6 Demucs PyTorch

View on GitHub

Research