You’ll know A1 Bulgaria is the right place for you if you are driven by:

  • Opportunities to learn and build your career;
  • Meaningful work in a stable and fast-paced company;
  • Diversity of people, projects, and platforms;
  • A supportive, fun, and inspiring place to work.

For our team we are looking for:

Cloud Service Reliability Lead

We are seeking a highly skilled and motivated Cloud Service Reliability Lead to join our dynamic team. As the Cloud Service Reliability Lead, you will play a crucial role in maintaining the reliability and performance of the CDC (Cloud Delivery Center) infrastructure, ensuring seamless operations for our clients. You will be responsible for overseeing various aspects of infrastructure management, automation, incident response, and security compliance.

Your daily routine would include:

  • Maintaining Infrastructure Reliability to ensure the continuous and optimal performance of the CDC infrastructure;
  • Providing expert guidance and support for migration projects, using existing templates and proven methodologies;
  • Developing and implementing automation solutions to streamline operational tasks, enhance efficiency and reduce manual intervention in routine processes;
  • Handling and optimizing the demand process for cloud resources while aligning resource provisioning with business requirements and the cloud strategy;
  • Leading incident management efforts, ensuring rapid resolution and minimal service disruption while implementing self-healing mechanisms to proactively address potential issues;
  • Establishing comprehensive monitoring and alerting systems for infrastructure components;
  • Collaborating with the security team to enforce security policies and standards while ensuring infrastructure compliance with industry regulations and best practices;
  • Monitoring resource utilization, planning for capacity scaling as needed and optimizing resource allocation for cost-efficiency;
  • Maintaining and reporting on service-level agreements (SLAs) related to reliability, response times, and costs while continuously monitor and improve SLA adherence.

We’ll know you can make it if you have:

  • Substantial background in an IT management or leadership role;
  • Proven experience in cloud infrastructure management and reliability engineering such as designing, maintaining and deploying;
  • A deep understanding of cloud platforms such as Exoscale, Azure, etc. and comprehensive knowledge of their features;
  • Expertise in automation processes and proficiency in various scripting languages such as Python, PowerShell or Bash;
  • Experience with incident management, monitoring, and alerting tools.
  • Proven track record of accurately estimating and provisioning cloud resources to meet demands without over-provisioning;
  • Excellent communication and leadership skills;
  • Fluency in English language (both written and spoken).

Our gratitude for the job done will be eternal, but we’ll also offer you:

  • Innovative technologies and platforms to “play” with;
  • Modern working environment for your comfort;
  • Friendly, ambitious, and motivated teammates to support each other;
  • Thousands of online and in-person learning opportunities to grow;
  • Challenging assignments and career development opportunities in multinational environment;
  • Attractive remuneration package;
  • Flexible working schedule and opportunity for home office;
  • Numerous additional goodies, including, but not limited to free A1 services, discounts, health insurance and services, sports center, childcare, team and family events, etc.

Not sure yet? See us in action in our A1 Blog.

Any questions? Contact Ivan – Asen Koritarov

Sounds good? Apply now!

PS It’s all very real, we’ve got all kinds of awards as well to prove it.

Deadline for applications: 3 January 2024

Only shortlisted candidates will be contacted.

Apply

Tags: