FPT Software is seeking highly skilled and experienced Senior DevOps Engineers to join our dynamic engineering team. In this critical role, you will be instrumental in building, scaling, and maintaining our cloud infrastructure on AWS, ensuring the reliability, security, and performance of our applications and services. You will champion DevOps best practices, drive automation, and collaborate closely with development, security, and operations teams to deliver high-quality software efficiently.
Responsibilities:
• Design, implement, and manage our AWS cloud infrastructure, ensuring scalability, reliability, and security.
• Develop and maintain IaC using tools like Terraform or CloudFormation.
• Implement and manage CI/CD pipelines using tools like Jenkins, GitLab CI, CircleCI, or AWS CodePipeline.
• Establish and maintain comprehensive monitoring, logging, tracing, and alerting systems to proactively identify and resolve issues.
• Manage and orchestrate containerized applications using Kubernetes, ECS, Lambda
• Implement and enforce security best practices across the infrastructure and deployment processes.
• Optimize cloud costs and resource utilization.
• Contribute to the definition and implementation of our branching and release processes.
• Troubleshoot and resolve complex infrastructure and application deployment issues.
• Collaborate effectively with development teams to integrate and deploy applications seamlessly.
• Document infrastructure configurations, processes, and best practices.
• Stay up-to-date with the latest AWS services and DevOps trends.
• Participate in on-call rotations as needed.
Required Skills and Experience:
• Fluency in English communication
• Extensive experience with AWS is a must
• Solid understanding of cloud landing zone concepts and best practices.
• Proven ability to implement and optimize cloud costs.
• Strong understanding of security principles and best practices in cloud environments.
• Significant experience with Kubernetes for container orchestration.
• Proficiency in setting up and utilizing monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack, CloudWatch).
• Experience with tracing and Application Performance Monitoring (APM) tools (e.g., Datadog, New Relic, AWS X-Ray).
• Strong Linux system administration skills.
• Deep understanding of DevOps processes, including CICD, and various branching models (e.g., Gitflow).
• Proficient in using Git for version control and collaboration.
• Strong scripting skills in Shell (Bash, Zsh) and Python.
• Basic coding knowledge in at least one programming language.
• Fundamental understanding of database concepts and different database technologies (e.g., relational, NoSQL).
• Solid understanding of networking principles (TCP/IP, DNS, routing, firewalls).
Nice-to-Have Skills:
• Experience with other cloud platforms such as Google Cloud Platform (GCP) or Microsoft Azure.
• Experience with IaC tools beyond Terraform/CloudFormation (e.g., Ansible, Chef, Puppet).
• Advanced coding skills in one or more programming languages.
• Experience with serverless architectures.
• Familiarity with security scanning and vulnerability management tools.
• Experience with agile development methodologies.