Kirill Chirkunov, Senior Data Scientist

Kirill Chirkunov

Senior Data Scientist

Presight

Location
United Arab Emirates - Abu Dhabi
Education
Doctorate, Engineering, Applied Artificial Intelligence, Multiagent Systems
Experience
17 years, 10 Months

Share My Profile

Block User


Work Experience

Total years of experience :17 years, 10 Months

Senior Data Scientist at Presight
  • United Arab Emirates - Abu Dhabi
  • My current job since May 2022

Full cycle development of media analytics solution (Azure Cloud, On-Premise), including:
● Data storage design and implementation,
● Training datasets annotation service: LLM - (Azure) ChatGPT 3.5 / 4, Bard / Gemini, Llama 2 70B, Mixtral 8x7B
● ML Models development (topic modeling, entity / relationship extraction, sentiment analysis, text categorization, clustering & profiling, emergent trends detection, media insights, logo detection),
● Applications of existing models: (B)(C)LIP (image search by text), Whisper (speech-to-text), voice actions (pauses and punctuation in speech), speech splitting, YOLOv*(object detection), image captioning, image hashing (similar images search), FastOCR (image to text),
● Services development such as text analysis, speech analysis, video & image analysis, OCR, trends, profiling,
● Long-running jobs development,
● Data pipelines development: scalable data collection / cleansing / transformation / analysis of text content / images / audio / video, sync / async data processing,
● Automation of CI / CD pipeline for smooth updates,
● Monitoring: Azure Monitor, Prometheus, LogStash, Kibana / ElasticSearch,
● Dashboard development and visualization (Kibana, OpenSearch).

Agent development based on large language models (ChatGPT, Mixtral 8x7B, FastChat/OpenAI compatible, Lanchain, LlamaIndex):
● Prompt templates + Schemas,
● Orchestrator, Function calls and Tools development (OpenAPI).

Tech Lead | Senior ML Engineer (cooperation with IBM Research) at Gazpromneft-NTC
  • Russian Federation - Saint Petersburg
  • May 2018 to May 2022

Leading project devoted to intelligent geo tools software development.
● Developed scalable ELT pipelines based on language models to extract key attributes from plain text core descriptions.
● Engineered a detection system for missing well-log series estimation.
● Implemented computer vision algorithms and ML-models for the detection of geological features in raw seismic data.
● Developed 1D probabilistic model for sedimentary environment with IBM Research cooperation, 2D probabilistic visual model for spatial sedimentary facies and 2D stochastic model for channels was developed individually by me.
● Implemented 2D facies map restoration model (GANs).
● Designed and deployed an integrated architecture for managing a seismic big data pipeline.
● Directed geographically distributed (DS + Product) teams, ensuring effective collaboration and productivity.
● Organized and managed the full development cycle from conceptualization to deployment based on On-Premise enterprise infrastructure + IBM Cloud Foundry.

Leading VLab project: a booking platform for core and fluid research maintenance.
● 130+ laboratory equipments and complex measurements.
● Reducing up to 30% time for the research booking full cycle.
● System design of key product features.
● Implementation of User Analytics modules.
● Developed regularly populated knowledge database for complex core/fluid experiments and measurements (data mining techniques),
● Rock samples classification and quality estimation ML/CV models development using PS-waves, X-ray and photo images (ResNet-50, ResNeXt),
● Directed software developers team, ensuring effective collaboration and productivity.

Math & Stat Analysis, Signal Processing, Natural Language Processing in GeoExploration
Python / Scala / Java / Tensorflow / Keras / PyTorch
RDBMS / PostgreSQL / NoSql / MongoDB / MinIO / S3 / Apache Jena / Knowledge Bases
Apache Spark / HDFS / Hive / Apache AirFlow
BigQuery / BigTable
Scalable DevOps / Cloud MLops pipeline

Ph.D., Assistant Professor at Novosibirsk State University
  • Russian Federation - Novosibirsk
  • February 2012 to May 2018

As an assistant professor taught computer science disciplines to master undergraduate students, managed and supervised master projects (55+ graduates).
● Developed half-a-year “Tools and Languages for Big Data Analysis” study course
● Developed half-a-year “Advanced Mobile Apps Development” study course
● Taught “System Design & Architecture” study course
● Invented complex algorithms for traffic management within the Smart Cities scientific initiative based on multiagent reinforcement learningAs an assistant professor taught computer science disciplines to master undergraduate students, managed and supervised master projects (55+ graduates). ● Developed half-a-year “Tools and Languages for Big Data Analysis” study course ● Developed half-a-year “Advanced Mobile Apps Development” study course ● Taught “System Design & Architecture” study course ● Invented complex algorithms for traffic management within the Smart Cities scientific initiative based on multiagent reinforcement learning
Skills: ML Programming · Big Data · Problem Solving · Big Data Analytics · Data Engineering · Deep Learning · Cloud Computing · Python Programming · Machine Learning

Senior Software Developer | Senior Machine Learning Engineer at Nyxale
  • Russian Federation - Moscow
  • January 2017 to May 2018

Design and engineering anti-fraud high-load systems for telecom and financial industry.
● Concentrated on fast anomalies detection algorithms within OLTP systems and data streaming.
● Significantly reduced the quantity of false positives detections.
● Developed detection templates and an internal script language for security specialists.
● Reduced time for the software delivery cycle: organized automated testing, created a transparent devops and ml-ops pipeline.

Languages: Python\R & C# & T-SQL
Tools & Technologies & Methods: Machine Learning programming, xgBoost & exGB, YOLO (2), imbalanced learning, pyTorch/Keras/TensorFlow, RabbitMQ, Redis, FireBase, MS SQL, PostgreSQL

Tech Lead | DW / BI / BigData Senior Developer at Melzdrav
  • Russian Federation - Moscow
  • May 2013 to December 2016

Leading development of enterprise program solutions for all major business departments of a pharmacy retail (sales, marketing, business partnerships, financial service, IT service).
▪ Management of outsourced product teams.
▪ DevOps pipeline construction.
▪ Writing technical specifications.
▪ Design and engineering enterprise apps.
▪ Design, engineering, development and maintenance data warehouses (BigData, ETL, OLTP, OLAP, Reports).
▪ Design, engineering, development and maintenance distributed real-time retail data exchange and analytics system.
▪ Programming of math and stat algorithms for supply chain optimization.
▪ Predictive analytics for sales and marketing departments.

Technologies: MS SQL, PostreSQL, MySQL, DW/BI, SSAS Data Mining, Big Data, SSIS, SSAS OLAP Dimension modeling, Applied AI technologies, Sharepoint / ASP.NET / MS Reporting Services, RabbitMQ / Kafka, Apache Hadoop, Apache Spark, Hive, MapReduce, Pig, AWS Services
Languages/scripts : C# / MDX / DMX / XMLA / T-SQL / Python / JavaLeading development of enterprise program solutions for all major business departments of a pharmacy retail (sales, marketing, business partnerships, financial service, IT service). ▪ Management of outsourced product teams. ▪ DevOps pipeline construction. ▪ Writing technical specifications. ▪ Design and engineering enterprise apps. ▪ Design, engineering, development and maintenance data warehouses (BigData, ETL, OLTP, OLAP, Reports). ▪ Design, engineering, development and maintenance distributed real-time retail data exchange and analytics system. ▪ Programming of math and stat algorithms for supply chain optimization. ▪ Predictive analytics for sales and marketing departments. Technologies: MS SQL, PostreSQL, MySQL, DW/BI, SSAS Data Mining, Big Data, SSIS, SSAS OLAP Dimension modeling, Applied AI technologies, Sharepoint / ASP.NET / MS Reporting Services, RabbitMQ / Kafka, Apache Hadoop, Apache Spark, Hive, MapReduce, Pig, AWS Services Languages/scripts : C# / MDX / DMX / XMLA / T-SQL / Python / Java
Skills: ML Programming · Web Development · Team Leadership · Big Data · Software Engineering · Java · Problem Solving · Big Data Analytics · Project Management · OLAP · Web Services · C# · Data Engineering · SQL · Data Visualization · NoSQL · Python Programming · Machine Learning · Scrum · Software Development

Senior Software Engineer (Microsoft Azure Team contractor) at EastBanc Technologies
  • Russian Federation - Novosibirsk
  • January 2012 to April 2013

Development of Azure-based SaaS-system for Web API professional management

Responsibilities:
- Automation of high-loaded web-services scaling,
- Development of security role-based system for shared access,
- Code refactoring,
- Design and development of system for exceptional scenarios handling,
- Integration, unit and stress tests,
- Operational issues/bug fixing,
- Design of hierarchical backups strategy,
- Web UI programming,
- and other.

Languages: С# / LINQ / Javascript / CSS 2.0+3.0 / XML / T-SQL / PowerShell & MSBuild script languages.
Technologies and Libraries: ASP MVC 4.5 / IIS / Windows Azure (including SQL Azure) / Web Services + REST API / JQuery & Knockout / Entity Framework / WASABI.
Software: Visual Studio 2010 / Visual Studio 2012, Team Foundation Server.
Type of process management: Kanban / ScrumDevelopment of Azure-based SaaS-system for Web API professional management Responsibilities: - Automation of high-loaded web-services scaling, - Development of security role-based system for shared access, - Code refactoring, - Design and development of system for exceptional scenarios handling, - Integration, unit and stress tests, - Operational issues/bug fixing, - Design of hierarchical backups strategy, - Web UI programming, - and other. Languages: С# / LINQ / Javascript / CSS 2.0+3.0 / XML / T-SQL / PowerShell & MSBuild script languages. Technologies and Libraries: ASP MVC 4.5 / IIS / Windows Azure (including SQL Azure) / Web Services + REST API / JQuery & Knockout / Entity Framework / WASABI. Software: Visual Studio 2010 / Visual Studio 2012, Team Foundation Server. Type of process management: Kanban / Scrum
Skills: Web Development · Big Data · Software Engineering · Problem Solving · Microsoft Azure · Web Services · C# · Data Engineering · Cloud Computing · Scrum · Software Development

Software engineer at GrapeCity
  • United States - Pittsburg
  • January 2011 to December 2011

Responsibilities:
- Technical documentation (specs, software design & architecture),
- Software development,
- Code reviews,
- Design & development of rendering engine RDL(X) - XAML,
- Unit tests,
- Issues/bugs fixing,
- Managing junior-programmers,
- Project tasks assessment,
- Research of products performance and new technologies.

Languages: С#, XAML, ActionScript (Flash),
Technologies & Libraries: Windows Forms / WPF / Silverlight 4, 5b / Flex, NUnit, MSBuild.
Software: Visual Studio 2008 / Visual Studio 2010 / Adobe Flash Builder / Blend, Mercurial, FogBugz.
Type of process management: Scrum / TDDResponsibilities: - Technical documentation (specs, software design & architecture), - Software development, - Code reviews, - Design & development of rendering engine RDL(X) - XAML, - Unit tests, - Issues/bugs fixing, - Managing junior-programmers, - Project tasks assessment, - Research of products performance and new technologies. Languages: С#, XAML, ActionScript (Flash), Technologies & Libraries: Windows Forms / WPF / Silverlight 4, 5b / Flex, NUnit, MSBuild. Software: Visual Studio 2008 / Visual Studio 2010 / Adobe Flash Builder / Blend, Mercurial, FogBugz. Type of process management: Scrum / TDD
Skills: Web Development · Big Data · Software Engineering · Java · Kanban · Web Services · C# · SQL · NoSQL · Scrum · Software Development

Software Engineer at Sitronics Telecom Solutions
  • Russian Federation - Moscow
  • August 2006 to December 2010

Development of software for monitoring and stat collecting (telecom hardware)

Responsibilities:
- Technology reviews and product proposals,
- Technical documentation (user manuals, software design&architecture documents, references),
- Software design and programming system modules,
- UI Design,
- Writing code,
- Software design and implementation of stat collection system (via SNMP v1/v2/v3),
- System profiling and performance tests,
- Code analysis,
- Issues/bugs fixing,
- Integration, stress, unit- tests,
- Refactoring and code optimization & profiling,
- Deployments and system configuration on customers side.

Languages: С#, XAML/XML/XSLT/XSD.
Technologies & Libraries: Windows Forms / WPF / .NET Addins, .NET Remoting, Enterprise Library, SNMP v1/v2/v3.
Software: Visual Studio 2005 / Visual Studio 2008 / HP Quality Center / Subversion / TFS, NUnit, MSBuild.Development of software for monitoring and stat collecting (telecom hardware) Responsibilities: - Technology reviews and product proposals, - Technical documentation (user manuals, software design&architecture documents, references), - Software design and programming system modules, - UI Design, - Writing code, - Software design and implementation of stat collection system (via SNMP v1/v2/v3), - System profiling and performance tests, - Code analysis, - Issues/bugs fixing, - Integration, stress, unit- tests, - Refactoring and code optimization & profiling, - Deployments and system configuration on customers side. Languages: С#, XAML/XML/XSLT/XSD. Technologies & Libraries: Windows Forms / WPF / .NET Addins, .NET Remoting, Enterprise Library, SNMP v1/v2/v3. Software: Visual Studio 2005 / Visual Studio 2008 / HP Quality Center / Subversion / TFS, NUnit, MSBuild.
Skills: Big Data · Software Engineering · Web Services · C# · Scrum · Software Development

Education

Doctorate, Engineering, Applied Artificial Intelligence, Multiagent Systems
  • at Ministry of Education and Science Russian Federation
  • December 2012

Graduated with excellence

Specialties & Skills

Team Management
Machine Learning
Computer Vision
Natural Language Processing
Natural Language Understanding
Cloud Computing
Data Science
Semantic search
Big Data
Large Language Models
Machine Learning
Team Management
Data Engineering
Software Design
Project Management
Frontend Programming
Predictive analytics

Languages

English
Expert
Russian
Native Speaker

Memberships

IEEE
  • Participant
  • March 2012
ACM
  • Participant
  • August 2020
EAGE
  • Participant
  • May 2021

Training and Certifications

Google Cloud Certified Professional Data Engineer (Certificate)
Date Attended:
April 2011
AWS Certified Data Analytics – Specialty (Certificate)
Date Attended:
July 2020
Big Data Analysis with Scala and Spark (Certificate)
Date Attended:
October 2020
Generative AI with Large Language Models (Certificate)
Date Attended:
September 2023