Job Description

NEW

Kyiv, Ukraine

Senior Site Reliability Engineer

Skelia invites a Senior Site Reliability Engineer for long-term and full-time employment at its service center in Kyiv, Ukraine. This is a unique opportunity to become a part of the extension team of software company from UK.

Mission

Utilize your knowledge of Web, App, Network, Server, Storage and Security technologies to administer, monitor and troubleshoot application and network components in our cloud-based environment

Create and monitor dashboards and alerts for key infrastructure metrics, and business KPIs that relate to site reliability.

Provide production support (L2) for Customer Support and Product Development teams through triage, troubleshooting, and remediation of incidents and problems

Respond to monitoring alerts in a rotating on-call team environment

Provide analysis of complex system behavior, and resolution of performance and application issues

Identify and remediate gaps in metrics gathered, monitoring, alerting, dashboarding and trending

Develop in DevSecOps field

Actively contribute to the initiatives implying platform resilience, performance, scalability

Improve our build and deployment processes

Requirements

1-2 years of hands-on experience with Amazon Web Services and/or Azure Cloud Services deployment and operations experience

Deep knowledge on AWS (Cloud Computing: Ec2, ECS, Fargate, S3, CloudFront, RDS, Security Groups, ELB, ElastiCache) or relevant for Azure (Azure Devops, Key Vaults)

Experience in infrastructure as code and configuration automation tools (Terraform preferable)

Strong understanding of networking principles and architectures

Knowledge and experience with database administration

Container and container orchestration experience

Experience with CI infrastructure and software development workflow automation tools, such as GitHub Actions, Jenkins

Experience using Git

Experience with monitoring infrastructure like Grafana, Prometheus, Datadog

Experience in keeping services up 24/7

Ability to define actionable monitoring and alerting for systems

Deep knowledge of Linux internals and administration

Strong teamwork and communication skills

Demonstrated ability to clearly document and communicate technical issues

High level of flexibility; ability and willingness to work with the required time shifts to support 24/7 operational model

Upper-intermediate or higher level of English

Skelia Offers

Skelia offers - for job sections

 

 

In your resume please allow our company to use your personal data.

Share this job

all positions