Senior SRE/DevOps Engineer - Trading Systems (AWS, Kubernetes)
Key Responsibilities
- Monitor and troubleshoot production trading systems, identifying performance bottlenecks, service interaction issues, and creating actionable tickets for development teams.
- Conduct thorough incident resolution with comprehensive root cause analysis and detailed reporting, collaborating across teams to solve complex infrastructure challenges.
- Design and implement sophisticated monitoring solutions utilizing Zabbix, Grafana, Dynatrace, and ELK stack to provide real-time visibility into system health.
- Establish and maintain robust build, release, and configuration management processes that ensure consistency and reliability across environments.
- Deploy, automate, and manage AWS cloud-based production systems with emphasis on high availability, performance optimization, scalability, and enterprise-grade security.
- Implement and orchestrate Kubernetes clusters (v1.28+) for containerized applications with focus on resilience and horizontal scalability.
- Develop and refine CI/CD pipelines using Jenkins 2.x and GitLab CI to streamline deployment workflows and reduce time-to-production.
- Configure and manage Infrastructure as Code using Terraform and Terragrunt for consistent, version-controlled infrastructure deployments.
- Administer development, QA, and production environments throughout the entire software development lifecycle.
Required Skills
- Strong knowledge of Linux/Unix systems (Ubuntu 22.04, CentOS 8, or equivalent) with demonstrated troubleshooting expertise.
- Minimum 3+ years of hands-on experience with containerization technologies including Docker and Kubernetes orchestration.
- Demonstrated proficiency with Infrastructure as Code tools, particularly Terraform, Ansible, Chef, or Puppet.
- Comprehensive understanding of web server architecture and configuration, especially Nginx and reverse proxy implementations.
- In-depth knowledge of HTTP protocol stack and RESTful service architectures.
- Practical experience implementing and maintaining CI/CD pipelines using Jenkins and/or GitLab CI.
- Advanced proficiency with Git version control systems and GitOps workflows.
- Working knowledge of SQL and experience with PostgreSQL (v14+) or MySQL (v8+).
- Fundamental understanding of networking concepts including TCP/IP, DNS, load balancing, and security principles.
