π Backend Engineer | MLOps & AI Infrastructure | Python & C++
I am a driven software engineer with a strong focus on building scalable backend systems, robust AI/ML infrastructure, and secure microservices. My expertise spans Clean Architecture, Natural Language Processing (NLP), and deploying Local LLMs (RAG pipelines) into production. I am passionate about taking complex concepts from zero to production and solving real-world business challenges without compromising data privacy.
- AI Infrastructure: Building Zero-Leak Enterprise RAG systems, integrating Local LLMs (Ollama, Llama 3), and working with Vector Databases (Qdrant).
- Backend Architecture: Designing asynchronous, high-load APIs using FastAPI, Celery, Redis, and PostgreSQL with a strict adherence to Repository/Service patterns and Dependency Injection.
- MLOps & NLP: Fine-tuning Transformer models (XLM-RoBERTa), building NER pipelines, and packaging ML cores into containerized, production-ready microservices.
- Description: A production-ready, fully local Retrieval-Augmented Generation (RAG) backend designed for B2B environments requiring strict data privacy. Features asynchronous PDF ingestion, dual-memory architecture (Qdrant + Postgres), background task offloading, and a secure JWT authentication flow with Token Blacklisting.
- Tech Stack: FastAPI, Qdrant, PostgreSQL, Celery, Redis, Ollama, Docker Compose.
- Link: http://github.com/thatwhocode/rag_saas
- Description: Developed and deployed a complete ML system for Named Entity Recognition (NER) based on XLM-RoBERTa for a government contractor. Fine-tuned the model to extract weapon characteristics from unstructured text with 97% accuracy. Designed as a fully on-premise, containerized microservice.
- Tech Stack: Python, PyTorch, Hugging Face, FastAPI, Docker, CI/CD.
- Link: https://github.com/thatwhocode/neuro_app
- Description: Built a custom HTTP server from scratch in C++ to deepen my understanding of low-level networking, memory management, and concurrent connection handling using native Linux Sockets API.
- Tech Stack: C++, Linux API, POSIX Sockets.
- Link: https://github.com/thatwhocode/http_server_cxx
- Email: thatwhocode@gmail.com
- LinkedIn: Stanislav Utkin