Site Reliability Engineer

 

Description:

You are responsible for:

  • Performing day-to-day operational/DevOps tasks on Wikimedia’s public facing infrastructure (deployment, maintenance, configuration, troubleshooting
  • Implementing and utilizing configuration management and deployment tools (Puppet, Kubernetes)
  • Leading continuous improvement, by automating the installation, configuration and maintenance of services on our platform
  • Assisting in the architectural design of new services and making them operate at scale
  • Assisting in or leading incident response, diagnosis, and follow-up on system outages and alerts across Wikimedia’s production infrastructure
  • Share our values and work in accordance with them

Skills and Experience:

  • 2+ years experience in an SRE/Operations/DevOps role as part of a team
  • Experience with operating highly available infrastructure
  • Comfortable with shell and a programming language used in an SRE/Operations engineering context (Python, Go, Ruby, etc.)
  • Experience with package management for operating systems (Debian, etc)
  • Comfortable with Open Source configuration management and orchestration tools (Puppet, Ansible, TerraForm etc.)
  • Past exposure to automation and streamlining of tasks
  • Communicative technical English

Organization Wikimedia Foundation
Industry Engineering Jobs
Occupational Category Engineer
Job Location Riyadh,Saudi Arabia
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Intermediate
Experience 2 Years
Posted at 2023-11-07 8:52 am
Expires on 2024-10-20