Актуальные заказы по Kubernetes

Monitoring and Observability Engineer

Full-time
Удаленно

This role involves designing, implementing, and managing comprehensive monitoring solutions using Prometheus, Grafana, SNMP-Exporter, Streaming Telemetry, OpenTelemetry, and other related technologies.


Responsibilities

- Design, implement, and manage Prometheus-based monitoring solutions, including configurations and alert rules.

- Develop and maintain interactive and visually appealing Grafana dashboards.

- Configure SNMP modules/jobs to scrape SNMP metrics for different network technologies in a very optimized way.

- Strong knowledge of Git to be able to clone working branches, develop, and commit to the main branch. Or other approaches, but show a strong hold on Git usage.

- Identify and onboard new metrics from various systems and applications, developing data pipelines for metrics collection and storage.

- Optimize and scale monitoring environments to handle large volumes of metrics and ensure comprehensive monitoring coverage.

- Implement and manage Streaming Telemetry solutions for real-time data collection and monitoring.

- Integrate and manage OpenTelemetry for comprehensive tracing and observability across services.

- Troubleshoot and resolve issues related to data collection, monitoring configurations, and dashboard performance.

- Ensure proper instrumentation of applications and infrastructure with DevOps, development, and operations teams.

- Document configurations, procedures, and provide training to team members and stakeholders.

 

Skills

- Familiarity with network monitoring tools and practices.

- Extensive experience with Prometheus and related technologies (Alertmanager, Pushgateway, etc.).

- Strong knowledge of time-series databases and monitoring concepts.

- Proficiency in writing Prometheus queries (PromQL).

- Strong experience with Grafana and its ecosystem.

- Proficiency in creating and managing Grafana dashboards and panels.

- Knowledge of data visualization principles and best practices.

- Familiarity with monitoring and observability tools and practices.

- Strong knowledge of SNMP protocols and network device management.

- Experience with SNMP-Exporter and its integration with Prometheus.

- Strong in SNMP module creation and scrape congas for various network technologies.

- Strong Git experience.

- Strong understanding of metrics and monitoring concepts.

- Experience with metrics collection tools (Prometheus, Telegraf, Collectd, etc.).

- Experience with Streaming Telemetry solutions for real-time monitoring.

- Experience with OpenTelemetry for tracing and observability.

- Familiarity with Linux/Unix systems and scripting languages (Bash, Python).

- Experience with containerization and orchestration tools (Docker, Kubernetes).

 

Qualification 

- Bachelor’s degree in Computer Science, Engineering, or related. 

- 5+ years of experience in monitoring and observability roles.

- Proficiency in tools like Prometheus, Grafana, PromQL, Alertmanager, Alert Framework, GitHub, SNMP-exporter, Streaming-Telemetry, Otel.

- Strong coding and scripting skills.

- Excellent problem-solving abilities and attention to detail.

- Strong communication and teamwork skills.

 

Senior DevOps engineer

Удаленно
Full-time

Обязанности:

- Развёртывание и сопровождение инфраструктуры в Yandex Cloud с использованием Terraform и CLI.

- Работа в рамках PCI DSS-контура: соблюдение требований, взаимодействие с кураторами и службами ИБ.

- Поддержка Kubernetes-кластеров (managed): Helm, ingress, autoscaling, probes. Управление жизненном циклом приложений. 

- Настройка и эксплуатация CI/CD пайплайнов на базе ArgoCD, GitHub Actions и Jenkins.

- Внедрение и поддержка мониторинга и алертинга: Prometheus, VictoriaMetrics, Grafana, Alertmanager.

- Автоматизация конфигураций и процессов с использованием Ansible, Bash, Python.

- Управление секретами через HashiCorp Vault.

- Сопровождение приложений на Ruby и Java.


Требования:

- Опыт работы с Yandex Cloud: compute, VPC, IAM, KMS, object storage и др.

- Уверенные знания Kubernetes: Helm, ingress, сетевые политики, RBAC.

- Опыт с Terraform и Ansible в продакшене.

- Практический опыт построения и сопровождения CI/CD-процессов.

- Уверенные навыки работы в Linux-среде: администрирование, отладка, автоматизация.

- Понимание принципов работы WAF.

- Навыки работы с Vault или другими системами управления секретами.

- Опыт написания скриптов на Bash / Python.

- Хорошее понимание сетевого стека: TCP/IP, DNS, NAT, балансировка, TLS


Будет плюсом:

- Опыт работы с приложениями на Ruby on Rails и Java.

- Знакомство с observability-стеком: OpenTelemetry, Fluent Bit.

- Понимание аудиторских требований и опыт прохождения проверок на соответствие PCI DSS, ISO 27001 и др.

- Опыт управления инфраструктурой в многоаккаунтной / мультиклаудной среде.


Условия:

- Удалённая работа, гибкий график.

- Прямая коммуникация с разработкой, ИБ и архитектурой.

- Отсутствие избыточной бюрократии.

- Конкурентная зарплата, обсуждается на основе опыта.

Senior Python Developer

Responsibilities: 

  • Building backend services and solutions for our digital portfolio and digital platforms that enable our customers to collect, process and visualize data. 
  • Develop reliable microservices for web application backends. 
  • Developing new reusable code and libraries for future use, and ensuring the technical feasibility of existing and future BE designs. 
  • Deliver error-free code through testing and working with QA teams, in order to meet the quality standards. 
  • Maintenance of existing software and code. You will be required to work and communicate with team members and product owners to ensure user requirements are understood, and design expectations are met. 


Requirements: 

5+ years of recent hands-on coding and software design Python

  • Experience with working in containerized environments. API use, design, and development such as for RESTful. 
  • Familiar with: Database technologies like MySQL, PostgreSQL, MS SQL, Influx, Redis Web frameworks Flask, Django 
  • Message brokers like RabbitMQ WebSockets protocol 
  • Knowledge of code versioning tools such as Git. An analytical mindset. 
  • Proven organizational and time management skills. 
  • Curious, and willing to learn new technologies. 
  • Comfortable working in an agile environment. 
  • Good communication and teamwork skills; working proficiency in English (written and spoken). 
  • Self-motivating, able to assume responsibility and work autonomously in a professional manner. 
  • Willingness to learn new languages, frameworks, and technologies. 


Nice to have:

  • Experience with Kubernetes. 
  • Experience with Jira and Confluence. 
  • Experience with AWS/Azure, other cloud platforms. 
  • Experience with ModBus protocol. Experience with NumPy. 
  • Experience with Swagger.