Oxygen X Finance Company Limited Plc is a credit company backed by one of Africa’s largest Banking Groups, with a goal to build Africa’s largest consumer and SME lender. We aim to build the most accessible, convenient, digital credit solutions on the continent. We take our mission and our user’s challenges extremely seriously and are seeking brilliant,...
Read more about this company
We are seeking an experienced Site Reliability Manager with 7+ years of hands-on experience in maintaining, optimizing, and scaling infrastructure, ensuring reliability and performance for mission-critical systems. You will collaborate closely with development, operations, and security teams to automate processes, monitor systems, and resolve operational issues while improving the overall user experience.
In this role, you will
Maintain the availability and performance of critical infrastructure.
Implement monitoring, logging, and alerting systems.
Ensure high uptime and fast response times across services.
Develop and maintain infrastructure automation tools.
Drive Infrastructure as Code (IaC) practices (Terraform, Ansible, etc.).
Implement CI/CD pipelines and automated testing.
Act as the escalation point for critical system incidents.
Lead incident response, recovery, and root cause analysis.
Conduct post-mortems and implement long-term solutions.
Design scalable systems to grow with demand.
Optimize systems for cost efficiency and performance improvements.
Ensure security best practices and compliance with industry standards.
Manage vulnerability assessments and data protection.
Work closely with development and product teams to align goals.
Provide mentorship to junior engineers and promote knowledge sharing.
We are looking for people with
7+ years experience in site reliability, DevOps, or related engineering roles.
Experience with security practices and compliance (SOC2, GDPR).
Hands-on experience with CI/CD tools like Jenkins, GitLab CI/CD, or GitHub Actions.
Familiarity with containerization and orchestration tools like Docker and Kubernetes.
Experience with cloud platforms such as AWS, Azure and serverless computing.
Understanding of networking concepts and protocols (TCP/IP, DNS, HTTP, SSL/TLS).
Proficient in Linux/Unix system administration and troubleshooting.
Familiarity with monitoring and logging tools like Prometheus, ELK stack (Elasticsearch, Logstash, Kibana), Grafana or Splunk.
Infrastructure as Code (IaC) proficiency in Terraform, CloudFormation, or Ansible.
Strong analytical and problem-solving skills with the ability to troubleshoot complex issues in distributed systems.
Excellent communication and collaboration skills to work effectively in a cross-functional team environment.
Database Management experience with MySQL, PostgreSQL, MongoDB, etc.
Ability to work in fast-paced environments and handle multiple projects.
Staff Turnover and How to Calculate ItIn this article, we'll explain what staff turnover means, how to calculate it, why it matters, and what businesses can do to reduce it.
MyJobMag Career Kickstart Scholarship 2026: Training Report & HighlightsFollowing the resounding success of the pilot programme, the MyJobMag Career Kickstart Scholarship 2025, the second edition was launched in 2026 to expand impact and deepen outcomes. Here's everything you need to know about how the training went.
AI's Impact on Jobs and Organisations (Nigeria report)This report examines the extent to which AI is affecting jobs and organisations in Nigeria. It brings together perspectives from HR professionals and managers across different industries.