← В ленту
Регистрация: 22.09.2022

Alexander Egorenkov

IT
middle
Специализация: Big Data Solution Architect and Engineer

Портфолио

NewYorker

- Design and implementation data pipelines for DataLake and for BI applications. - Migration to DataLake based architecture. - Analyzis after-migration data, alignment with beforemigration data. - Code review, CI/CD support, pipelines monitoring.

Accenture

Products Industry Client, Germany, D&A Hub (BI, AI) ●Responsibilities: Design and implement data pipelines for ML tasks in AWS Cloud; Data Science support Technologies: AWS EMR, Jenkins, Spark, PySpark, Python, R, S3, Hive, Jupyter notebook ● Products Industry Client, Germany, Big Data Aftersales (Data Lake ETL) Responsibilities: Data Lakes design and implementation; updates, testing, deployment; local team lead / tech lead Technologies: Hadoop, Hive, Spark, Oozie, AWS Glue, Athena, Terraform ● Global Telecom Client, Europe (Warehouse DataBase development) Responsibilities: Design data preparation workflows for DWH Technologies: AWS, Redshift, Matillion, Aginity, DataRow, SQL ● Global Online Tech Client, IDS (Intelligent Datasets development) Responsibilities: Datasets design Technologies: AWS, Snowflake, Lambda, Glue, Python, SQL ● Products Industry Client, Check product quality application (AI-based object recognition for check product quality application) Responsibilities: Choosing and adapting NN models, dataset creation, models training, backend server deployment Technologies: Computer Vision, Python, TensorFlow, Keras ● Liquid Studio, Smart Traffic (AI-based traffic control for the city road systems) Responsibilities: NN models adaptation, model training Technologies: Reinforcement learning, Python, TensorFlow ● Data & Analytics, Krypton (Big Data test and training environment) Responsibilities: VMware ESXi, Cloudera cluster deployment and configuration Technologies: VMware, Cloudera, Spark, Kafka

ElektroTechCompany

My team designed and implemented a line of city-level automated systems for control and commercial accounting of heating and water supply, including new industrial controllers and the online portal for the collection, processing, and visualization of information.

Скиллы

Airflow
Databricks
Hive
PySpark
Python
Redshift
Scala
Snowflake
Spark
SQL

Опыт работы

Senior Data Engineer
12.2021 - 06.2022 |NewYorker
Cloudera, Airflow, Spark, PySpark, Impala, MS SQL DB
- Design and implementation data pipelines for DataLake and for BI applications. - Migration to DataLake based architecture. - Analyzis after-migration data, alignment with beforemigration data. - Code review, CI/CD support, pipelines monitoring.
Team Lead
03.2020 - 11.2021 |Accenture
AWS EMR, Jenkins, Spark, PySpark, Python, R, S3, Hive, Jupyter notebook, VMware, Cloudera, Spark, Kafka
Products Industry Client, Germany, D&A Hub (BI, AI) ●Responsibilities: Design and implement data pipelines for ML tasks in AWS Cloud; Data Science support Technologies: AWS EMR, Jenkins, Spark, PySpark, Python, R, S3, Hive, Jupyter notebook ● Products Industry Client, Germany, Big Data Aftersales (Data Lake ETL) Responsibilities: Data Lakes design and implementation; updates, testing, deployment; local team lead / tech lead Technologies: Hadoop, Hive, Spark, Oozie, AWS Glue, Athena, Terraform ● Global Telecom Client, Europe (Warehouse DataBase development) Responsibilities: Design data preparation workflows for DWH Technologies: AWS, Redshift, Matillion, Aginity, DataRow, SQL ● Global Online Tech Client, IDS (Intelligent Datasets development) Responsibilities: Datasets design Technologies: AWS, Snowflake, Lambda, Glue, Python, SQL ● Products Industry Client, Check product quality application (AI-based object recognition for check product quality application) Responsibilities: Choosing and adapting NN models, dataset creation, models training, backend server deployment Technologies: Computer Vision, Python, TensorFlow, Keras ● Liquid Studio, Smart Traffic (AI-based traffic control for the city road systems) Responsibilities: NN models adaptation, model training Technologies: Reinforcement learning, Python, TensorFlow ● Data & Analytics, Krypton (Big Data test and training environment) Responsibilities: VMware ESXi, Cloudera cluster deployment and configuration Technologies: VMware, Cloudera, Spark, Kafka
Project Manager
11.2014 - 09.2016 |ElektroTechCompany
Lead the engineering department that provided technical solutions for the management of the Central Heating (CH) and Domestic Hot Water (DHW) systems
My team designed and implemented a line of city-level automated systems for control and commercial accounting of heating and water supply, including new industrial controllers and the online portal for the collection, processing, and visualization of information.

Образование

Applied mathematics and physics (Магистр)
1992 - 1994
Moscow Institute of Physics and Technology State University
Applied mathematics and physics (Бакалавр)
1988 - 1992
Moscow Institute of Physics and Technology State University

Языки

РусскийРоднойАнглийскийСвободно владею