Job Opportunity
We are seeking a skilled and motivated engineer to fill a critical role within our organization.
- Develop and implement monitoring and alerting systems to quickly detect and respond to incidents.
- Lead incident response efforts, including root cause analysis, mitigation, and post-mortem reporting.
- Analyze system performance and identify areas for improvement.
Required Qualifications :
Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience.3+ years of experience in a similar role, preferably in a SaaS environment.Strong knowledge of infrastructure as code (IaC) tools such as Terraform.Preferred Skills :
Proficiency in Google Cloud Platform (GCP).Experience with containerization and orchestration tools like Docker and Kubernetes.Experience with monitoring and logging tools such as Datadog.Fundamental Knowledge :
Familiarity with CI / CD pipelines and tools like Google Cloud Build, and GitHub Actions.Proficiency in scripting languages such as Python, Bash.Knowledge of Redis and MongoDB.Attention to detail and a commitment to reliability and quality.