Max Dokukin - XeWe Labs

I believe in 3 things

Ask my AI Secretary

About Me

Able to take in a lot of chaos and turn it into something manageable. I find much solace in that.
Been to 108 cities, and travelled distance equal to 8 equators.
My primary professional focus is LLM applications, and personal is pretty LED lights solutions.

Profile Picture

Projects

Work Experience

2023
2024
2025

ML Engineer Intern

Nuvoton

ML Engineer Intern

Nuvoton

May 2024 - August 2024


Skills:
Python, C++, Embedded Systems, Machine Learning, Audio DSP, Signal Processing, Data Augmentation, Model Optimization, Quantization, Confusion Matrices, CI/CD, Testing, Documentation, Problem Solving, Communication
Frameworks:
TensorFlow, TensorFlow Lite, scikit-learn, Keras, NumPy, Pandas, Matplotlib, GitHub Actions, Keil uVision
Accomplishments:
  • On-device sound event classification on MCU targets; baseline 49% → ~90%+ under defined test conditions
  • Built C++ test harness and Python evaluator; computed TP/TN/FP/FN, generated confusion matrices and hit-rate plots
  • Designed YAML-driven PipelineController for deterministic preprocessing: rename → segment → trim silence → normalize → augment → balance
  • Implemented selective normalization and gain jitter to preserve loudness cues and reduce false triggers
  • Tuned features: expanded MFCC 10→30 within MCU memory and timing limits; aligned window and stride with DSP constraints
  • Trained 30+ model variants (CNN, DS-CNN, LSTM); standardized configs for reproducible experiments
  • Exported minimal-operator TFLite models; quantized to int8 with ~60% size reduction and <2% accuracy delta
  • Engineered C++ KeywordSpottingSystem class and sliding-window InferenceFilter to stabilize predictions
  • Sized tensor_arena explicitly after profiling; prevented runtime OOM and fragmentation issues
  • Automated nightly CI jobs: preprocess → train → convert → flash → run tests → push CSVs and plots
  • Added distractor negatives to reduce false positives; established standard test conditions for volume and distance
  • Authored quick-start, pipeline guide, and troubleshooting notes; documented rationale for feature and threshold choices
  • Collaborated with firmware on linker scripts, DMA/I2S use, and memory maps; added guards for underflow/Na

Embedded ML Engineer

Nuvoton

Embedded ML Engineer

Nuvoton

August 2024 - Present


Skills:
Python, C++, Embedded Systems, Audio DSP, Signal Processing, Data Engineering, Model Optimization, Quantization, Memory Optimization, DMA, I2S, I2C, UI Development, Performance Testing, CI/CD, Documentation, Communication
Frameworks:
TensorFlow, TensorFlow Lite, Keras, NumPy, Pandas, Matplotlib, GitHub Actions, Keil uVision
Accomplishments:
  • Scope: productionizing edge AI stack, dataset expansion, robust MCU deployment with readable UI
  • Data ops at scale: YouTube and Pixabay scraping, mining and labeling, standardized file schema class_label_origin_name.wav, sidecar metadata and provenance index
  • Data loader redesign: AudioProcessor and DataLoader, shuffling and weighting, labeling modes, reduced loader from 525 to 407 lines, guarded edge cases
  • Scheduling and experimentation: unattended batch runs, background noise sweeps, 3D volume analyses, chosen plateaus documented in run book
  • Model ops: model_id metadata, auto config.txt emission, confusion matrix auto trim and transpose, consistent plots for comparison
  • Embedded architecture: KeywordSpottingSystem class, main.c reduced to a minimal loop, clean C and C++ boundaries
  • UI and display: DynamicText with bounded buffers and leak fixes, 100 ms refresh, collapsed repeats, field friendly readout
  • Memory and peripherals: display buffer 16 bit to 8 bit, RAM fit from about 648 kB down to near 510 kB, mapped interfaces, resolved PDMA conflict paths
  • I/O and buses: verified I2S and I2C plans, tuned priorities, added checks for NaNs and underflow to stabilize timing
  • Multi model deployment: command word spotting and sound event classification side by side, metadata printed at boot
  • Research and robustness: evaluated unknown handling strategies, class weighting, targeted negative sampling, failure log drove new data
  • Tooling and docs v2: eighteen pages across pipeline, APIs, training and deployment playbooks, integration diagrams
  • Cross team integration: reviews for linker flags, memory maps, DMA plans, serial formats; delivered demo ready builds

Upper Math Tutor

San Jose State University

Upper Math Tutor

San Jose State University

May 2024 - May 2025


Skills:
Math, Teaching, Collaboration, Calculus, Statistical Analysis, Hypothesis Testing, Linear Regression, Communication, Mentorship, Problem Solving
Accomplishments:
  • Assisted students with calculus, statistical analysis, hypothesis tests, and linear regression

Learning Assistant

San Jose State University

Learning Assistant

San Jose State University

December 2023 - May 2024


Skills:
Teaching, Collaboration, Multivariable Calculus, Feedback, Group Facilitation, Mentorship, Course Customization, Communication, Problem Solving, Leadership
Accomplishments:
  • Facilitated small-group learning activities in multivariable calculus class for 40 students
  • Communicated weekly with faculty to convey student feedback, and tailor the course

Philosophy Tutor and Grader

San Jose State University

Philosophy Tutor and Grader

San Jose State University

February 2024 - May 2025


Skills:
Teaching, Communication, Philosophy, Critical Thinking, Reasoning, Problem Solving, Mentorship, Collaboration, Academic Writing, Grading
Accomplishments:
  • Guided students in understanding philosophical theories and reasoning
  • Helped professors with grading

Machine Learning Intern

Yandex

Machine Learning Intern

Yandex

May 2023 - August 2023


Skills:
Machine Learning, NLP, Data Analysis, Pandas, NumPy, Python, Linux, Data Cleaning, Data Visualization, Automation
Accomplishments:
  • Preprocessed large datasets (1M+ records) using Pandas and NumPy, ensuring data quality through data mining and data cleaning
  • Managed data processing on remote servers via Linux terminal, including file manipulation, batch processing, and script automation, up to 320GB daily
  • Performed data visualization with Matplotlib and Seaborn, aiding in the interpretation of neural natural language processing (NLP)
ⓘ Click on experience for details

Education

2023
2025
2027

Master of Science

Artificial Intelligence

San Jose State University

Master of Science

Artificial Intelligence

San Jose State University


Accomplishments:

Bachelors of Science

Data Science

San Jose State University

Bachelor's of Science

Data Science

San Jose State University


Accomplishments:
  • Cum Summa Laude
  • Best Data Science Project SJSU 2025 Winner
  • GitHub Repository
  • Live Project Website
  • Presidents Scholar
  • Dean’s Scholar
  • AS Leadership Scholar
  • AI & ML Club President
  • Baja Data Acquisition Team Lead

Associate of Science

Mathematics

San Jose City College

Associate of Science

Mathematics

San Jose City College


Accomplishments:
  • GPA: 3.92
ⓘ Click on a block for details

Classes

2025
2026

AI & Data Engineering

CMPE 252

Artificial Intelligence and Data Engineering

CMPE 252

Dr. Jun Liu


Topics covered:
  • Problem solving by searching
  • Intelligent agents
  • Supervised machine learning
  • Unsupervised learning
  • Neural networks and deep learning
  • Reinforcement learning
  • Knowledge representation and reasoning
  • Data engineering process
  • Model evaluation and validation
  • AI ethics and responsible AI

Machine Learning

CMPE 257

Machine Learning

CMPE 257

Dr. Bernardo Flores


Topics covered:
  • Supervised learning
  • Unsupervised learning
  • Generalization and bias variance tradeoff
  • Linear models for classification and regression
  • Nonlinear feature transformation
  • Regularization and validation
  • Kernel methods and support vector machines
  • Radial basis function networks
  • Ensemble methods
  • Neural networks

Math of Decision, DS

ISE 201

Math Foundations for Decision and Data Sciences

ISE 201

Dr. Shilpa Gupta


Topics covered:
  • Linear algebra fundamentals
  • Matrix operations
  • Eigenvalues and eigenvectors
  • Singular value decomposition
  • Probability and statistics for data science
  • Hypothesis testing and estimation
  • Regression modeling
  • Optimization and convexity
  • Gradient-based optimization
  • Classification foundations

Artificial Intelligence

CS 156

Intro to Artificial Intelligence

CS 156

Rula Khayrallah


Topics covered:
  • Search algorithms
  • AI agents
  • Knowledge representation
  • ML fundamentals
  • NLP processing
  • Constraint satisfaction problems
  • Logical inference techniques
  • AI biases
  • Game theory
  • Decision-making
  • AI in real-world

Individual Studies

CS 180

Individual Studies

CS 180

Dr. Fabio Di Troia


Topics covered:
  • Multi-agent AI
  • Specialized agents
  • LLM frameworks
  • Data retrieval
  • SQL processing
  • System design
  • Interactive deployment
  • Code modularity
  • Performance evaluation
  • Documentation writing
  • Presentation

Data Science Senior Prj.

CS 163

Data Science Senior Project

CS 163

Dr. Genya Ishigaki


Topics covered:
  • Data preprocessing
  • Exploratory data analysis
  • Statistical modeling and inference
  • Regression and classification techniques
  • ML model development
  • Data visualization and storytelling
  • Feature engineering
  • Big data frameworks
  • Communicating technical findings
  • Collaboration in data-driven projects
ⓘ Click on class for details

Let's Stay in Touch