Site Reliability Engineer
Posted bythe hiring team· 2 days ago
- Location
Posted bythe hiring team· 2 days ago
Site Reliability Engineer
USD 66,023 – USD 75,531
Be among the first applicants
Verified team
HR-vetted before going live.
Transparent pay
Salary stated upfront.
Be among the first applicants
Just opened — your application stands out.
About this role
(#4479)
We are looking for an experienced Site Reliability Engineer to ensure the stability, scalability, and operational excellence of a Kubernetes-based platform running in a hybrid environment.
The project is entering a pivotal phase, with a major go-live planned for mid-February and a target audience of 75,000 users. User onboarding is already underway, with over 5,000 users connected and 15,000–20,000 expected to be active by year-end. While the system is stable, we anticipate increased activity and new challenges in January, February, and after the go-live—making this an exciting opportunity to make a real impact. The role focuses on performance optimization, scaling strategies, observability, and reliability engineering.
Required Skills:
4+ years of experience as SRE / DevOps Engineer
Strong hands-on experience with Kubernetes in production
Experience working with hybrid infrastructure (on-prem + cloud)
Solid knowledge of PostgreSQL performance tuning and scaling
Experience with Qdrant or other vector databases
Experience with CI/CD workflows, Helm, Kubernetes autoscaling, and resource optimization
Familiarity with observability stacks (Prometheus, Grafana, ELK/Loki)
Understanding of performance engineering and load testing
Experience with Linux systems and networking
Strong troubleshooting and incident-management skills
Strong Python skills; Rust exposure is a plus
Strong experience with infrastructure as code (Terraform)
Nice to Have:
Experience with STACKIT or other sovereign clouds
Experience with PgBouncer
Knowledge of SRE practices (SLO/SLI)
Experience in regulated or public-sector environments
German language skills
Responsibilities:
Operate and optimize hybrid infrastructure (on-prem & STACKIT)
Manage and scale Kubernetes clusters
Optimize Helm charts, resource usage, and autoscaling
Conduct performance, load, and stress testing
Ensure reliability, availability, and monitoring of production systems
Tune and operate PostgreSQL
Operate and optimize vector databases (e.g. Qdrant)
Implement monitoring, logging, and alerting
Support incident response and capacity planning
We offer:
Flexible working format - remote, office-based or flexible
A competitive salary and good compensation package
Personalized career growth
Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
Active tech communities with regular knowledge sharing
Education reimbursement
Memorable anniversary presents
Corporate events and team buildings
Other location-specific benefits
The Site Reliability Engineer role with the hiring team offers USD 66,023–75,531 per year. Salary information is published as part of every JobRemotely listing so candidates can self-screen before applying.
Yes — the hiring team has marked this Site Reliability Engineer role as open to candidates based in Poland. Eligibility requirements are surfaced in the JobPosting structured data on the listing.
The hiring team uses the JobRemotely structured hiring pipeline: candidates apply through the listing, complete a paid test task or screening, and only then proceed to interviews. This skips the resume black hole and respects everyone's time.
Similar roles
Hand-picked from the same category.
the hiring team· San Francisco·Remote·2 days ago
USD 216,000 – USD 329,400
Viewthe hiring team· San Francisco·Remote·2 days ago
USD 211,000 – USD 234,000
Viewthe hiring team· San Francisco·Remote·2 days ago
USD 216,000 – USD 240,000
Viewthe hiring team· San Francisco·Remote·2 days ago
USD 347,700 – USD 385,000
View