Responsibilities:
- You will serve as the primary point of contact for issues within our clients' system and infrastructure domain.
- You are responsible for monitoring and ensuring the smooth operation of our clients' cloud environments.
- You will devise solutions for high-priority and essential business problems.
- Collaborate with a diverse range of software and engineering groups to bolster processes in their current platform.
- Provide troubleshooting expertise and be part of the on-call team to address urgent system outages alongside client teams.
- Conduct in-depth analysis to determine root causes, pinpoint recurring problems, and aim to create automated resolutions for them.
- Integrate closely with teams emphasizing development, quality assurance, performance, reliability, infrastructure, safety, and regulatory compliance.
- Your role involves an 8-hour shift rotation, ensuring continuous 24x7 support.
Qualifications:
- Holds a bachelor’s degree or an advanced educational qualification.
- Proficient in managing Linux systems, including Ubuntu and RHEL.
- A minimum of 3 years of hands-on experience with:
- Cloud Platforms: Notably AWS managed services.
- Infrastructure as Code (IaC) solutions: Such as Hashicorp Terraform, CloudFormation, Chef, Ansible, Puppet, among others.
- Build/Release Processes: Knowledge in Git/Github/Gitlab, Artifactory, CI/CD tools like Jenkins, TeamCity, Spinnaker, and build utilities like Maven and Gradle.
- Containerization Technologies: Expertise in Kubernetes, Docker, Hashicorp Nomad, EKS, and ECS.
- Logging & Monitoring: Familiarity with Prometheus, Thanos, Grafana, Splunk, and AWS CloudWatch.
- Has supported infrastructure in agile settings for at least 3 years.
- Capable of crafting automation scripts utilizing Bash, Python, or Go.
- Solid understanding of DevOps methodologies and associated tools.
- Strong communication abilities, both orally and in writing.
Plus points but not requirement:
- Holds certifications pertinent to the mentioned key qualifications.
- Skilled in setting up reverse proxy servers, including Envoy and NGINX.
- Adept in Networking, encompassing TCP/IP, Security, tools like HSM, Netscaler, IPtables, and network routing.
- Familiar with service mesh technologies, especially Istio and Consul.
- Has hands-on experience with both SQL and NoSQL database systems, such as Oracle and Cassandra.