Jobs Career Advice Signup
X

Send this job to a friend

X

Did you notice an error or suspect this job is scam? Tell us.

  • Posted: Aug 4, 2023
    Deadline: Not specified
    • @gmail.com
    • @yahoo.com
    • @outlook.com
  • Never pay for any CBT, test or assessment as part of any recruitment process. When in doubt, contact us

    We are building an ecosystem that simplifies how businesses accept payments, make payments and manage operations. This journey started in 2016 with simplifying access to financial services using "Kudi.ai" a chatbot integration that responds to financial requests on social apps.

    It then morphed into powering a community of independent b...
    Read more about this company

     

    Site Reliability Engineer

    Job Description

    • As a Site Reliability Engineer at Nomba, you will play a crucial role in bridging the gap between software development and IT operations, with a strong focus on solving IT operations problems using software engineering.
    • You will be responsible for designing, implementing, and maintaining the infrastructure, tools, and processes required to support our development and deployment pipelines.
    • You will react in real time to production incidents and work to contain and resolve them as quickly as possible.
    • Your expertise in automation, cloud technologies, and continuous integration/continuous deployment (CI/CD) will ensure that our software is delivered efficiently, reliably, and at scale.

    About the role

    • Implement and maintain highly available, scalable, and secure production systems, emphasising automation and Infrastructure as Code (IaC) principles.
    • Collaborate with software development teams to influence the architecture and design of applications for better scalability, reliability, and performance.
    • Develop and maintain monitoring, alerting, and logging solutions to proactively detect and resolve system issues.
    • Respond to incidents and outages, conducting root cause analysis, and implementing preventative measures to minimize future occurrences.
    • Participate in on-call rotations and provide timely response to critical incidents.
    • Continuously improve system performance through performance tuning, capacity planning, and load testing.
    • Implement security best practices, ensuring that systems are compliant with industry standards and regulations.
    • Automate routine operational tasks using scripting and programming languages.
    • Work with cross-functional teams to define and document operational procedures and runbooks.
    • Contribute to the improvement of the CI/CD pipelines to ensure seamless deployments.
    • Keep abreast of industry trends, emerging technologies, and best practices in SRE and cloud infrastructure management.

    About you

    • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience).
    • Proven experience as a Site Reliability Engineer, DevOps Engineer, or a similar role managing large-scale, highly available production systems.
    • Solid experience with cloud platforms (e.g., AWS, Azure, GCP), including proficiency in provisioning and managing resources.
    • Strong understanding of Linux/Unix systems and command-line utilities.
    • Proficiency in at least one programming or scripting language (e.g., Python, Ruby, Bash, PowerShell).
    • Experience with Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
    • Knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes).
    • Familiarity with monitoring tools and concepts (e.g., Prometheus, Grafana, ELK stack).
    • Understanding of networking protocols, load balancing, and firewalls.
    • Strong problem-solving and troubleshooting skills, with a focus on root cause analysis.
    • Excellent communication and collaboration skills to work effectively with cross-functional teams.

    Nice to have

    • Relevant certifications in SRE, DevOps, or cloud technologies.
    • Experience with databases and data management (e.g., SQL, NoSQL, caching systems).
    • Knowledge of configuration management tools (e.g., Ansible, Puppet, Chef).
    • Understanding of Agile methodologies and experience in Agile/Scrum environments.
    • Familiarity with security practices and compliance frameworks.

    Method of Application

    Interested and qualified? Go to Nomba (Formerly Kudi) on nomba.talentlyft.com to apply

    Build your CV for free. Download in different templates.

  • Send your application

    View All Vacancies at Nomba (Formerly Kudi) Back To Home

Subscribe to Job Alert

 

Join our happy subscribers

 
 
Send your application through

GmailGmail YahoomailYahoomail