Home Dr. Piotr Gryko - Curriculum Vitae
Dr. Piotr Gryko - Curriculum Vitae
Cancel

Dr. Piotr Gryko - Curriculum Vitae

Research-Focused AI Engineer | PhD Experimental Physics
đź“§ Piotr.Gryko@gmail.com
🔗 LinkedIn • GitHub • GitLab
📍 Europe


Professional Summary

Research-focused AI engineer with 12+ years experience bridging academic research and production systems. PhD in Experimental Physics with proven track record of identifying and correcting issues in published research, developing novel AI architectures, and scaling systems processing millions in monthly revenue. Specializes in document processing, computer vision, AI safety, and high-performance distributed systems.


Core Technical Expertise

AI/ML Research: Diffusion Models, Computer Vision, NER, LLM Safety, PyTorch, Signal Processing Production Systems: Microservices, AsyncIO, Docker, High-Performance Computing
Data Engineering: PostgreSQL, Elasticsearch, Vector Databases, Real-time Pipelines
Languages: Python, C/C++, JavaScript/TypeScript, SQL Infrastructure: Modal.com, AWS, Docker Swarm, CI/CD, Monitoring


Professional Experience

R&D Software Engineer | Hypodossier AG | Jun 2023 - Present

Document Processing Platform

  • Architected extraction platform with microservices for PDF processing, text classification, and semantic analysis
  • Developed multi-language document classification using SpaCy NLP for German financial documents
  • Built asynchronous processing architecture using RabbitMQ and AsyncIO for high-throughput processing
  • Client Impact: Serving 10+ banking clients with white-label solutions

AI Research Integration

  • Collaborated with ML scientists to reproduce academic papers for product development
  • Implemented NER using SpaCy and LayoutLLM for financial document analysis

Senior ML Engineer / AI Research Engineer | DevaLogic Projects | 2023-Present

Document Anonymization System

  • Architected production-ready document anonymization using PyTorch, Diffusion Models (VAE/UNet), and NER
  • Research Contribution: Corrected hyperparameter bugs in original DiffUTE paper, achieving significant performance improvements
  • Built enterprise inference engine with thread-safe operations, LRU caching, batch optimization, and comprehensive OCR integration
  • Designed scalable GPU training infrastructure on Modal.com (A100s) with W&B experiment tracking
  • Impact: Featured in talks at EuroPython 2025 and PyCon Lithuania 2025

AI-Powered Document Chat Application

  • Developed full-stack RAG system with self-hosted LLM components for enhanced privacy
  • Implemented asynchronous Django REST API with streaming responses and WebSockets
  • Built ChromaDB-based semantic search with LangChain text processing and SHA-256 deduplication
  • Architecture: React TypeScript frontend, PostgreSQL backend, containerized deployment

Property Suitability ML Pipeline

  • Developed end-to-end ML pipeline for energy efficiency assessment achieving F1 scores up to 0.78
  • Engineered 40+ feature pipeline with geospatial data and NLP-derived insights using NLTK
  • Implemented automated hyperparameter optimization using Optuna with MLflow tracking

Drone Dataset Collection System

  • Built production AI pipeline for automated drone footage processing into YOLO training datasets
  • Performance: AVideo processing with GPU acceleration and quality filtering
  • Developed ensemble YOLO detection system with confidence thresholding and batch optimization
  • Implemented microservices architecture with Kubernetes deployment and comprehensive monitoring

Property Energy Efficiency ML Pipeline | 2024 End-to-end ML system for EPC rating prediction

  • Engineered 40+ features with geospatial and NLP components
  • Achieved F1 score of 0.78 using XGBoost with Optuna optimization
  • Built MLflow experiment tracking with automated hyperparameter tuning

Senior Software Engineer | Qogita | Mar 2022 - Jun 2023

Scaling MVP to $20M/month Revenue

  • Led backend and data engineering operations using Django, Celery, Pandas, Snowflake, MongoDB
  • Migrated payment system from Square to Wise with zero downtime
  • Implemented bi-directional data synchronization between Snowflake and PostgreSQL
  • Built pandas-based microservice for seller stockist data ingestion (Excel, CSV)

R&D Software Engineer | Unipart Digital | Oct 2015 - Jan 2022

Multidisciplinary Technology Leadership

  • Led full-stack development projects combining web development, data analysis, and embedded systems
  • Developed predictive analytics systems using Python, NumPy, Pandas for logistics optimization
  • Managed small engineering teams (3-4 developers) delivering complex technical solutions
  • Built production systems using Django, React, Docker with comprehensive DevOps practices

R&D Software Engineer | ION Geophysical | Jun 2013 - Oct 2015

High-Performance Computing Systems

  • Developed signal processing tools for HPC systems using C++
  • Implemented numerical algorithms optimized for geophysical data processing
  • Integrated Python/NumPy into C++ processing systems for enhanced functionality

Education

PhD in Experimental Physics - Nanotechnology & Biomaterials | Imperial College London | 2007-2012
Research focus: Self-assembling biomaterials for bio-sensing applications, nanotechnology characterization

MSci Physics, First Class Honours University College London 2003-2007

Recent Speaking Engagements

2025 Conferences:

“Anonymization of Sensitive Information in Financial Documents” - EuroPython 2025, PyCon Lithuania 2025, Data Science Summit: ML Edition 2025

2024 Conferences:

PyCon Lithuania 2024 - “Building and Scaling an AI Startup with Async Django” 🎥 Watch Talk

PyCon Lithuania 2024 - “Scaling, Refactoring and fixing a Django MVP for Production” 🎥 Watch Talk


Notable Projects & Research

AI Safety Research - Comprehensive LLM security testing suite for healthcare AI companions
Computer Vision Pipeline - 200-400 FPS autonomous drone detection system with quality filtering
Financial Document Intelligence - Semantic processing platform serving major European banks
Privacy-Preserving ML - Production document anonymization correcting published research flaws


Community Involvement

Founder & Organizer - “AI Code & Coffee Warsaw” tech mentoring meetup Facebook
Open Source Contributor - Active on GitHub and GitLab
Technical Mentor - Supporting Warsaw’s developer community through knowledge sharing


Publications & Research Impact

  • Identified and corrected critical bugs in DiffUTE paper implementation (2024)
  • Featured speaker at major European Python and AI conferences
  • Research focus on practical applications of cutting-edge AI in regulated industries
  • Bridge between academic innovation and commercial viability in AI systems

Available for consulting, speaking engagements, and research collaborations in AI/ML, computer vision, and privacy-preserving systems.