Site Reliability Engineer (Aviation)

Przegląd oferty

Lokalizacja
Remote, Remote
Rodzaj pracy
Pełny etat
Wynagrodzenie netto
25,200 zł - 28,560 zł Za MiesiÄ…c
Data opublikowania
3 lat temu

Szczegóły

ID oferty
3751
Typ pracy
Zdalnie
Rozmiar firmy
ponad 200
Wynagrodzenie
Umieszczone w ofercie
Wykorzystywane technologie
PowerShell, Azure, Azure DevOps, CI/CD, Linux, Windows, Kubernetes, Docker, RabbitMQ
Typ umowy
B2B
Rekrutacja
Online
Rekrutacja w języku
Polski Angielski
Korzyści
Prywatna opieka medyczna, Spotkania integracyjne,
Poziom doświadczenia
Senior
Wymagana
Inny

Opis oferty

  • Hands-on experience and extensive knowledge of security assurance tools and Azure native cloud; ARM, Aqua, Azure Monitor, Azure AppInsights, Azure DevOps, etc. and their integration into the CI/CD cycle
  • Hands-on experience and extensive knowledge of AKS (Azure Kubernetes Service)
  • Infrastructure as code (IAC) patterns and principles
  • Azure CLI (Command Line Interface), Bash and PowerShell scripting capabilities
  • Growth mindset and automation experience with PowerShell
  • Strong debugging, testing / validation, and analytics skills
  • Good understanding of REST (representational state transfer) API (Applications and Programming Interfaces) principles
  • Understanding of Microservice architecture
  • In-depth Knowledge of Helm / Service Mesh / Kubernetes and Docker containerization
  • Good understanding of Azure Container Registry
  • In-depth knowledge of both Linux and Windows
  • Knowledge of service and messaging bus (RabbitMQ)
  • Experience with failover/recovery scenarios in MS Azure
  • Experience optimizing applications, both stand-alone and in distributed systems to maximize performance
  • Experience maintaining an infrastructure and ensuring stability and reliability while adding new features and capabilities
  • Ability to clearly articulate design and implementation choices

Opis oferty 

Site Reliability Engineer - Aviation Industry

Implement SRE within the Leasing Portfolio. And be responsible for improving and maturing the development cycles in integration/continuous deployment mode. You would partner with software engineering product teams by suggesting scalable ways to optimize code, sharing expertise and insights drawn from working across related services or products, and participating in incident response throughout development and operations lifecycles.

Leverage end-to-end technical expertise in large-scale, distributed systems' infrastructure, code, inter- and intra-service dependencies, and operations to improve the reliability, performance, efficiency, latency, and scalability of services proactively.

  • Develop code, scripts, systems and tools that reduce operational burden by automating complex and repetitive tasks, enables engineering teams to increase the velocity at which they can safely deploy changes to production, and monitors the effects of changes across systems, services, and products. 

  • Analyze telemetry data to develop capacity planning models, identify patterns, and trends that drive continuous improvement, and highlight opportunities to deploy automation to monitor and manage services and/or products.

  • Participate in on-call rotations to resolve live-site incidents, minimize customer impact, and document solutions and insights that inform ongoing improvements to infrastructure, code, tools, and/or processes that prevent the recurrence of similar issues.

Zakres obowiązków

  1. Demonstrates end-to-end expertise in distributed systems design, interactions between cloud technology layers and components, functions of physical network devices, and dependencies at scale. 
  2. Develops end-to-end technical expertise in the architecture, code, features, and operations of specific products as required to implement improvements in product availability, reliability, efficiency, observability, and/or performance. 
  3. Drives code/design reviews with the engineering teams that develop and/or manage those products and shares learnings and recommendations across engineering teams working on related products within their organization. 
  4. Researches and maintains deep knowledge of industry trends as well as advances in large-scale, distributed systems and cloud technologies; identifies opportunities to create, implement, and/or optimally utilize new tools, technologies, and/or processes to solve ambiguous problems and improve product availability, reliability, efficiency, observability, and/or performance. 
  5. Drives the adoption of innovative solutions across engineering teams working with related products within an organization and provides guidance and coaching to others on relevant topics. 
  6. Develops, tests, and implements changes to optimize code and improve the observability, reliability, and operability of platforms, systems, and products at scale. Reviews the effect of these changes to document and share development insights within their team. 
  7. Develops code, scripts, systems, or platforms that automate moderately complex but repetitive operations processes (e.g., monitoring, alerting, deploying products and updates, debugging) at scale; reviews existing automation code and scripts to evaluate reusability, extendibility, and scalability within an organization. 
  8. Analyzes data from telemetry pipelines and monitoring tools that detail operations metrics (e.g., availability, reliability, performance, efficiency) of systems, platforms, or products operating at scale.