Kirill Chirkunov, Senior Data Scientist

Kirill Chirkunov

Senior Data Scientist

Presight

البلد
الإمارات العربية المتحدة - أبو ظبي
التعليم
دكتوراة, Engineering, Applied Artificial Intelligence, Multiagent Systems
الخبرات
17 years, 10 أشهر

مشاركة سيرتي الذاتية

حظر المستخدم


الخبرة العملية

مجموع سنوات الخبرة :17 years, 10 أشهر

Senior Data Scientist في Presight
  • الإمارات العربية المتحدة - أبو ظبي
  • أشغل هذه الوظيفة منذ مايو 2022

Full cycle development of media analytics solution (Azure Cloud, On-Premise), including:
● Data storage design and implementation,
● Training datasets annotation service: LLM - (Azure) ChatGPT 3.5 / 4, Bard / Gemini, Llama 2 70B, Mixtral 8x7B
● ML Models development (topic modeling, entity / relationship extraction, sentiment analysis, text categorization, clustering & profiling, emergent trends detection, media insights, logo detection),
● Applications of existing models: (B)(C)LIP (image search by text), Whisper (speech-to-text), voice actions (pauses and punctuation in speech), speech splitting, YOLOv*(object detection), image captioning, image hashing (similar images search), FastOCR (image to text),
● Services development such as text analysis, speech analysis, video & image analysis, OCR, trends, profiling,
● Long-running jobs development,
● Data pipelines development: scalable data collection / cleansing / transformation / analysis of text content / images / audio / video, sync / async data processing,
● Automation of CI / CD pipeline for smooth updates,
● Monitoring: Azure Monitor, Prometheus, LogStash, Kibana / ElasticSearch,
● Dashboard development and visualization (Kibana, OpenSearch).

Agent development based on large language models (ChatGPT, Mixtral 8x7B, FastChat/OpenAI compatible, Lanchain, LlamaIndex):
● Prompt templates + Schemas,
● Orchestrator, Function calls and Tools development (OpenAPI).

Tech Lead | Senior ML Engineer (cooperation with IBM Research) في Gazpromneft-NTC
  • روسيا - Saint Petersburg
  • مايو 2018 إلى مايو 2022

Leading project devoted to intelligent geo tools software development.
● Developed scalable ELT pipelines based on language models to extract key attributes from plain text core descriptions.
● Engineered a detection system for missing well-log series estimation.
● Implemented computer vision algorithms and ML-models for the detection of geological features in raw seismic data.
● Developed 1D probabilistic model for sedimentary environment with IBM Research cooperation, 2D probabilistic visual model for spatial sedimentary facies and 2D stochastic model for channels was developed individually by me.
● Implemented 2D facies map restoration model (GANs).
● Designed and deployed an integrated architecture for managing a seismic big data pipeline.
● Directed geographically distributed (DS + Product) teams, ensuring effective collaboration and productivity.
● Organized and managed the full development cycle from conceptualization to deployment based on On-Premise enterprise infrastructure + IBM Cloud Foundry.

Leading VLab project: a booking platform for core and fluid research maintenance.
● 130+ laboratory equipments and complex measurements.
● Reducing up to 30% time for the research booking full cycle.
● System design of key product features.
● Implementation of User Analytics modules.
● Developed regularly populated knowledge database for complex core/fluid experiments and measurements (data mining techniques),
● Rock samples classification and quality estimation ML/CV models development using PS-waves, X-ray and photo images (ResNet-50, ResNeXt),
● Directed software developers team, ensuring effective collaboration and productivity.

Math & Stat Analysis, Signal Processing, Natural Language Processing in GeoExploration
Python / Scala / Java / Tensorflow / Keras / PyTorch
RDBMS / PostgreSQL / NoSql / MongoDB / MinIO / S3 / Apache Jena / Knowledge Bases
Apache Spark / HDFS / Hive / Apache AirFlow
BigQuery / BigTable
Scalable DevOps / Cloud MLops pipeline

Ph.D., Assistant Professor في Novosibirsk State University
  • روسيا - Novosibirsk
  • فبراير 2012 إلى مايو 2018

As an assistant professor taught computer science disciplines to master undergraduate students, managed and supervised master projects (55+ graduates).
● Developed half-a-year “Tools and Languages for Big Data Analysis” study course
● Developed half-a-year “Advanced Mobile Apps Development” study course
● Taught “System Design & Architecture” study course
● Invented complex algorithms for traffic management within the Smart Cities scientific initiative based on multiagent reinforcement learningAs an assistant professor taught computer science disciplines to master undergraduate students, managed and supervised master projects (55+ graduates). ● Developed half-a-year “Tools and Languages for Big Data Analysis” study course ● Developed half-a-year “Advanced Mobile Apps Development” study course ● Taught “System Design & Architecture” study course ● Invented complex algorithms for traffic management within the Smart Cities scientific initiative based on multiagent reinforcement learning
Skills: ML Programming · Big Data · Problem Solving · Big Data Analytics · Data Engineering · Deep Learning · Cloud Computing · Python Programming · Machine Learning

Senior Software Developer | Senior Machine Learning Engineer في Nyxale
  • روسيا - Moscow
  • يناير 2017 إلى مايو 2018

Design and engineering anti-fraud high-load systems for telecom and financial industry.
● Concentrated on fast anomalies detection algorithms within OLTP systems and data streaming.
● Significantly reduced the quantity of false positives detections.
● Developed detection templates and an internal script language for security specialists.
● Reduced time for the software delivery cycle: organized automated testing, created a transparent devops and ml-ops pipeline.

Languages: Python\R & C# & T-SQL
Tools & Technologies & Methods: Machine Learning programming, xgBoost & exGB, YOLO (2), imbalanced learning, pyTorch/Keras/TensorFlow, RabbitMQ, Redis, FireBase, MS SQL, PostgreSQL

Tech Lead | DW / BI / BigData Senior Developer في Melzdrav
  • روسيا - Moscow
  • مايو 2013 إلى ديسمبر 2016

Leading development of enterprise program solutions for all major business departments of a pharmacy retail (sales, marketing, business partnerships, financial service, IT service).
▪ Management of outsourced product teams.
▪ DevOps pipeline construction.
▪ Writing technical specifications.
▪ Design and engineering enterprise apps.
▪ Design, engineering, development and maintenance data warehouses (BigData, ETL, OLTP, OLAP, Reports).
▪ Design, engineering, development and maintenance distributed real-time retail data exchange and analytics system.
▪ Programming of math and stat algorithms for supply chain optimization.
▪ Predictive analytics for sales and marketing departments.

Technologies: MS SQL, PostreSQL, MySQL, DW/BI, SSAS Data Mining, Big Data, SSIS, SSAS OLAP Dimension modeling, Applied AI technologies, Sharepoint / ASP.NET / MS Reporting Services, RabbitMQ / Kafka, Apache Hadoop, Apache Spark, Hive, MapReduce, Pig, AWS Services
Languages/scripts : C# / MDX / DMX / XMLA / T-SQL / Python / JavaLeading development of enterprise program solutions for all major business departments of a pharmacy retail (sales, marketing, business partnerships, financial service, IT service). ▪ Management of outsourced product teams. ▪ DevOps pipeline construction. ▪ Writing technical specifications. ▪ Design and engineering enterprise apps. ▪ Design, engineering, development and maintenance data warehouses (BigData, ETL, OLTP, OLAP, Reports). ▪ Design, engineering, development and maintenance distributed real-time retail data exchange and analytics system. ▪ Programming of math and stat algorithms for supply chain optimization. ▪ Predictive analytics for sales and marketing departments. Technologies: MS SQL, PostreSQL, MySQL, DW/BI, SSAS Data Mining, Big Data, SSIS, SSAS OLAP Dimension modeling, Applied AI technologies, Sharepoint / ASP.NET / MS Reporting Services, RabbitMQ / Kafka, Apache Hadoop, Apache Spark, Hive, MapReduce, Pig, AWS Services Languages/scripts : C# / MDX / DMX / XMLA / T-SQL / Python / Java
Skills: ML Programming · Web Development · Team Leadership · Big Data · Software Engineering · Java · Problem Solving · Big Data Analytics · Project Management · OLAP · Web Services · C# · Data Engineering · SQL · Data Visualization · NoSQL · Python Programming · Machine Learning · Scrum · Software Development

Senior Software Engineer (Microsoft Azure Team contractor) في EastBanc Technologies
  • روسيا - Novosibirsk
  • يناير 2012 إلى أبريل 2013

Development of Azure-based SaaS-system for Web API professional management

Responsibilities:
- Automation of high-loaded web-services scaling,
- Development of security role-based system for shared access,
- Code refactoring,
- Design and development of system for exceptional scenarios handling,
- Integration, unit and stress tests,
- Operational issues/bug fixing,
- Design of hierarchical backups strategy,
- Web UI programming,
- and other.

Languages: С# / LINQ / Javascript / CSS 2.0+3.0 / XML / T-SQL / PowerShell & MSBuild script languages.
Technologies and Libraries: ASP MVC 4.5 / IIS / Windows Azure (including SQL Azure) / Web Services + REST API / JQuery & Knockout / Entity Framework / WASABI.
Software: Visual Studio 2010 / Visual Studio 2012, Team Foundation Server.
Type of process management: Kanban / ScrumDevelopment of Azure-based SaaS-system for Web API professional management Responsibilities: - Automation of high-loaded web-services scaling, - Development of security role-based system for shared access, - Code refactoring, - Design and development of system for exceptional scenarios handling, - Integration, unit and stress tests, - Operational issues/bug fixing, - Design of hierarchical backups strategy, - Web UI programming, - and other. Languages: С# / LINQ / Javascript / CSS 2.0+3.0 / XML / T-SQL / PowerShell & MSBuild script languages. Technologies and Libraries: ASP MVC 4.5 / IIS / Windows Azure (including SQL Azure) / Web Services + REST API / JQuery & Knockout / Entity Framework / WASABI. Software: Visual Studio 2010 / Visual Studio 2012, Team Foundation Server. Type of process management: Kanban / Scrum
Skills: Web Development · Big Data · Software Engineering · Problem Solving · Microsoft Azure · Web Services · C# · Data Engineering · Cloud Computing · Scrum · Software Development

Software engineer في GrapeCity
  • الولايات المتحدة - Pittsburg
  • يناير 2011 إلى ديسمبر 2011

Responsibilities:
- Technical documentation (specs, software design & architecture),
- Software development,
- Code reviews,
- Design & development of rendering engine RDL(X) - XAML,
- Unit tests,
- Issues/bugs fixing,
- Managing junior-programmers,
- Project tasks assessment,
- Research of products performance and new technologies.

Languages: С#, XAML, ActionScript (Flash),
Technologies & Libraries: Windows Forms / WPF / Silverlight 4, 5b / Flex, NUnit, MSBuild.
Software: Visual Studio 2008 / Visual Studio 2010 / Adobe Flash Builder / Blend, Mercurial, FogBugz.
Type of process management: Scrum / TDDResponsibilities: - Technical documentation (specs, software design & architecture), - Software development, - Code reviews, - Design & development of rendering engine RDL(X) - XAML, - Unit tests, - Issues/bugs fixing, - Managing junior-programmers, - Project tasks assessment, - Research of products performance and new technologies. Languages: С#, XAML, ActionScript (Flash), Technologies & Libraries: Windows Forms / WPF / Silverlight 4, 5b / Flex, NUnit, MSBuild. Software: Visual Studio 2008 / Visual Studio 2010 / Adobe Flash Builder / Blend, Mercurial, FogBugz. Type of process management: Scrum / TDD
Skills: Web Development · Big Data · Software Engineering · Java · Kanban · Web Services · C# · SQL · NoSQL · Scrum · Software Development

Software Engineer في Sitronics Telecom Solutions
  • روسيا - Moscow
  • أغسطس 2006 إلى ديسمبر 2010

Development of software for monitoring and stat collecting (telecom hardware)

Responsibilities:
- Technology reviews and product proposals,
- Technical documentation (user manuals, software design&architecture documents, references),
- Software design and programming system modules,
- UI Design,
- Writing code,
- Software design and implementation of stat collection system (via SNMP v1/v2/v3),
- System profiling and performance tests,
- Code analysis,
- Issues/bugs fixing,
- Integration, stress, unit- tests,
- Refactoring and code optimization & profiling,
- Deployments and system configuration on customers side.

Languages: С#, XAML/XML/XSLT/XSD.
Technologies & Libraries: Windows Forms / WPF / .NET Addins, .NET Remoting, Enterprise Library, SNMP v1/v2/v3.
Software: Visual Studio 2005 / Visual Studio 2008 / HP Quality Center / Subversion / TFS, NUnit, MSBuild.Development of software for monitoring and stat collecting (telecom hardware) Responsibilities: - Technology reviews and product proposals, - Technical documentation (user manuals, software design&architecture documents, references), - Software design and programming system modules, - UI Design, - Writing code, - Software design and implementation of stat collection system (via SNMP v1/v2/v3), - System profiling and performance tests, - Code analysis, - Issues/bugs fixing, - Integration, stress, unit- tests, - Refactoring and code optimization & profiling, - Deployments and system configuration on customers side. Languages: С#, XAML/XML/XSLT/XSD. Technologies & Libraries: Windows Forms / WPF / .NET Addins, .NET Remoting, Enterprise Library, SNMP v1/v2/v3. Software: Visual Studio 2005 / Visual Studio 2008 / HP Quality Center / Subversion / TFS, NUnit, MSBuild.
Skills: Big Data · Software Engineering · Web Services · C# · Scrum · Software Development

الخلفية التعليمية

دكتوراة, Engineering, Applied Artificial Intelligence, Multiagent Systems
  • في Ministry of Education and Science Russian Federation
  • ديسمبر 2012

Graduated with excellence

Specialties & Skills

Team Management
Machine Learning
Computer Vision
Natural Language Processing
Natural Language Understanding
Cloud Computing
Data Science
Semantic search
Big Data
Large Language Models
Machine Learning
Team Management
Data Engineering
Software Design
Project Management
Frontend Programming
Predictive analytics

اللغات

الانجليزية
متمرّس
الروسية
اللغة الأم

العضويات

IEEE
  • Participant
  • March 2012
ACM
  • Participant
  • August 2020
EAGE
  • Participant
  • May 2021

التدريب و الشهادات

Google Cloud Certified Professional Data Engineer (الشهادة)
تاريخ الدورة:
April 2011
AWS Certified Data Analytics – Specialty (الشهادة)
تاريخ الدورة:
July 2020
Big Data Analysis with Scala and Spark (الشهادة)
تاريخ الدورة:
October 2020
Generative AI with Large Language Models (الشهادة)
تاريخ الدورة:
September 2023