Site Reliability Engineer Job at Xsolla, Remote

MjF0QzNuUGRMUmk2NFZXaTZ1Y2QvcC9kQVE9PQ==
  • Xsolla
  • Remote

Job Description

Requirements
  • Proven experience as a Site Reliability Engineer, or similar Software Engineering role in a large-scale production environment ( 5 years to 10 years)
  • overall in IT area (as Ops or Developer).
  • Proficiency in scripting languages such as Python, Bash. Strong understanding of Go and PHP will be a plus.
  • Deep knowledge of monitoring systems such as Datadog, Prometheus, Grafana.
  • Good understanding of continuous integration/continuous delivery processes and platforms (Gitlab preferred). Experience with Helm.
  • Experience with Docker, Kubernetes, or other container orchestration systems.
  • Familiarity with infrastructure automation tools like Terraform.
  • Experience with automation, system administration, and system hardening.
  • Experience with Linux-based infrastructures, Linux/Unix administration.
  • Demonstrated problem-solving skills, particularly debugging and troubleshooting complex software systems. Ability to work under pressure.
  • Excellent communication skills with a capacity to articulate and solve complex technical problems
  • Xsolla Technology Stack: Ubuntu, Kubernetes, Gitlab, Terraform, Terragrunt, Puppet, Nginx, Google Cloud Platform, Datadog, Prometheus, Grafana,
  • ELK, Zabbix and Harbor.
Responsibilities
  • Ensure high reliability and availability and meet SLAs, SLOs, and SLIs.
  • Monitor the system for issues and respond to incidents, ensuring quick resolution to maintain high system availability.
  • Drive incident resolution and process improvements to minimize downtime and increase operational transparency.
  • Ensure all key services are measured, monitored and raising alerts when needed.
  • Develop comprehensive monitoring solutions to provide full visibility to the different platform components using tools and services like Kubernetes, Datadog, Prometheus, Grafana and others.
  • Support services before they go live through activities such as capacity planning, monitoring setup, logging, and production readiness reviews.
  • Engage in service capacity planning and demand forecasting, performance analysis, and system tuning.
  • Collaborate with the development teams to enhance the product's operational stability.
  • Build and drive the automation systems that maintain system health
Education
  • IT professional certifications are not required, but it will be a plus
  • Certified Kubernetes Administrator or Developer
  • HashiCorp Certifications
  • GCP Certifications

Job Tags

Remote job, Full time,

Similar Jobs

A Quality Facility Services

Canton CFO / Controller Job at A Quality Facility Services

About AQFS? Founded in 2003, AQFS has thrived through challenging economic times, evolving from a one-person operation with a mop and bucket to a robust organization with over 300 employees. We proudly serve all of Ohio and Pennsylvania and aim to become a nationally...

Aerotek

Assembly Technician Job at Aerotek

**Job Title: Assembly Technician****Job Description**We are seeking a dedicated and detail-oriented Assembly Technician to join our Production team at our San Leandro, CA facility. This full-time position requires a quality-focused individual who thoroughly understands... 

Sentry Insurance

Territory U/W Specialist - Arizona Job at Sentry Insurance

Markets companies' products and services to assigned Commercial Lines accounts and independent agencies. Selects agencies through identification...  ...Ability to meet travel requirements of at least 33% of the time as you'll be prospecting new business opportunities in an agent... 

Pyramid Healthcare

Medical Assistant (Full Time) Job at Pyramid Healthcare

Medical Assistant (Full Time) Location Dallas, PA : Pyramid Healthcare is dedicated to offering the highest quality of care to those we serve...  ...Experience as a Medical Assistant through employment or internship preferred. Current CPR, AED, and First Aid certification... 

Redner's Warehouse Market

Dairy Clerk Job at Redner's Warehouse Market

 ...Position : Dairy Clerk Department: Deli Reports To: Deli Manager Flsa Status: Non-Exempt To maintain pricing, stocking, and rotation of merchandise in the dairy department. EssentialJob Functions: Assist in unloading the merchandise. Transport...