Research & Software Development Engineer

Artem
Shamsuarov

15+ years building production-grade computer vision systems. Specialized in real-time 3D reconstruction, multi-camera pipelines, SLAM, and GPU-accelerated processing. Published researcher with 7+ patents.

15+
Years C++ / CUDA
7+
Patents Granted
3
Countries Worked
2
Master's Degrees

Work Experience

Artec Europe S.a.r.l.
Research & Software Development Engineer
08/2020 — Present · Luxembourg
  • Developed production-grade C++ algorithms for industrial hardware products
  • Implemented real-time image processing pipelines for multi-sensor systems
  • Achieved 10–100x performance improvements through GPU (CUDA) optimization
  • Designed automated quality control and inspection systems
  • Maintained CI/CD pipelines (Jenkins) and mentored junior engineers
Artec Group
Research & Software Development Engineer
09/2015 — 08/2020 · Moscow, Russia
  • Built real-time algorithms for embedded hardware with onboard processing
  • Implemented geometry processing and spatial computing algorithms for production systems
  • Achieved 70% latency reduction through CUDA/OpenCL/Vulkan optimization
  • Designed software architecture for distributed computing clusters
  • Built multi-platform apps (iOS/Swift/ObjC, Windows/C++) for industrial products
Samsung R&D Institute Russia
Expert Engineer — Algorithm Laboratory
06/2012 — 09/2015 · Moscow, Russia
  • Developed CV algorithms and apps deployed to millions of mobile users
  • Built complete 3D reconstruction pipeline: SfM, SLAM, depth estimation, depth fusion
  • Implemented near-duplicate image retrieval and face detection systems
  • Published at IEEE CVPR Workshop; secured 3 patents
Samsung Electronics
R&D Engineer — Semiconductor Process Development
09/2009 — 06/2012 · Suwon, South Korea
  • Developed optimization algorithms for computational lithography at next-gen semiconductor nodes
  • Created C++/Qt/Python tools for Optical Proximity Correction and Resolution Enhancement
  • Collaborated with Cadence, Mentor Graphics, Synopsys; published at SPIE; obtained 2 US patents

Technical Skills

Core Programming
C++ (Expert, 15+ yrs) CUDA (Expert) Python C# MATLAB OOP & Design Patterns STL / Boost Multithreading Memory Management
👁 Computer Vision
Real-time Image Processing Multi-camera Systems 3D Reconstruction SLAM Structure-from-Motion Feature Detection & Matching Object Detection Stereo Vision Bundle Adjustment Camera Calibration Optical Flow Depth Estimation
🧊 3D Processing
Point Cloud Processing Mesh Generation Surface Reconstruction Registration Real-time Fusion Spatial Data Structures Next-best-view Planning
🔧 GPU & Parallel Computing
CUDA Optimization OpenCL Vulkan Compute Metal CUB / Thrust OpenMP TBB MPI
📐 Math & Algorithms
Optimization Algorithms Computational Geometry Linear Algebra Numerical Methods Statistics Mathematical Modeling
🛠 Tools & Platforms
OpenCV PCL Open3D CGAL Colmap ROS Eigen Ceres Git CMake Docker Jenkins Linux / Windows Nvidia Jetson Qt

Personal Explorations

Independent explorations built from scratch on personal time and hardware, unrelated to any employer's products or proprietary technology. Spanning ML inference, developer tools, quantitative finance, audio, and mobile apps.

Machine Learning & AI Tools
CuInfer

From-scratch LLM inference engine in C++17/CUDA. Custom tiled matmul, fused attention, RoPE, and SwiGLU kernels. GGUF model loading with Q4/Q5/Q8 quantized inference. Runs Llama 3.2 1B on a 6 GB GPU.

1.6x vs PyTorch eager 5 quant formats 221 tests
C++17 CUDA GGUF Quantization Transformers
View on GitHub
Engram

GPU-accelerated local semantic code index for Claude Code. CUDA-powered ONNX embeddings, HNSW vector search, tree-sitter AST chunking across 9 languages, and an MCP server for sub-3 ms context retrieval — fully on-device.

Sub-3 ms queries 9-language AST parsing 177 tests
C++17 CUDA ONNX Runtime tree-sitter MCP
View on GitHub
Quantitative Finance & GPU Computing
hft-orderbook-engine

Low-latency limit order book and matching engine with lock-free SPSC ring buffers, zero-allocation hot path, slab allocator, and market microstructure analytics (spread, microprice, order flow imbalance, Kyle's Lambda).

~124 ns median add 6M msgs/s throughput 322 unit tests
C++17 Lock-free SPSC Zero-alloc Microstructure
View on GitHub
cuda-portfolio-optimizer

GPU-accelerated portfolio optimization with Monte Carlo scenario generation (cuRAND + Cholesky), Mean-CVaR via custom ADMM solver, PCA factor model (15.6x speedup), and rolling-window backtesting with transaction costs.

12.5x GPU speedup PCA factor model 4-strategy backtest
C++17 CUDA cuRAND ADMM Mean-CVaR
View on GitHub
Audio & Music
PickHero

Desktop guitar practice app — load any Guitar Pro tab (GP3–GP8), plug in via USB, and get real-time pitch detection with hit/miss feedback and scrolling tab playback. A lightweight, free Yousician alternative.

Real-time YIN pitch detection GP3–GP8 tab support MIDI backing tracks
Python aubio PyGuitarPro PyGame
View on GitHub
VocalForge

Desktop app for recording vocals over music tracks. Automatic vocal separation via Demucs, cross-correlation alignment of recording to backing track, LUFS normalization (ITU-R BS.1770-4), and one-click export of the final mix.

Demucs vocal separation Auto-alignment (xcorr) LUFS normalization
Python PySide6 Demucs PyTorch
View on GitHub
Mobile & Desktop Apps
DocShot

Zero-tap document scanner for Android. Auto-detects document boundaries, captures on stability, corrects perspective with sub-pixel corner refinement. Classical image processing pipeline with multi-strategy preprocessing — no ML, no cloud.

~15 ms detection/frame ~120 ms full pipeline 74 tests
Kotlin Jetpack Compose OpenCV CameraX
TheaterEffectsViewerApp

Windows UWP app for triggering audio and visual effects during live theater performances. Two-window setup — operator control panel with color-coded effect list + theater display output to a projector. Dual media players with smooth 3-second crossfade transitions, YAML-based show definitions. Built for and used in actual live productions.

Dual-player crossfade YAML show definitions Used in live shows
C# UWP XAML Win2D YamlDotNet
View on GitHub

Academic Background

M.Sc. Electrical Engineering & Computer Science
Seoul National University
Control and Dynamics Systems Lab
2007 — 2009 · Seoul, South Korea GPA: 96.8 / 100
M.Sc. Mathematics & Mechanics
Saint-Petersburg State University
Mathematical Analysis and Theory of Functions
2002 — 2007 · Saint-Petersburg, Russia GPA: 97.5 / 100

Let's Connect

Passionate about computer vision, real-time systems, and GPU-accelerated computing. Based in Luxembourg.