Swifta Systems and Services is a professional services company focused on technology service delivery and business change management.
Read more about this company
Seeking self-motivated, highly experienced, and dedicated team players who are passionate about building digital and intuitive infrastructure.
The Enterprise Observability Engineer will be working with an infrastructure modernization team to build and operate Enterprise Observability Platform as key building block AI driven IT operation (AIOps).
Responsibilities
Drive the implementation and management of Enterprise Observability and AIOps platform by enabling predictive and self-healing capabilities for IT operation.
Plan, design, build and manage Enterprise Observability and AIOps platform based on the latest innovation to cover services running multi-cloud environment.
Collaborate and work with the various stakeholder to develop technical solutions, plans and configuration for the observability capability and develop run book automation with the objective to achieve self-healing capability.
Perform integration and develop full interoperability capabilities with various operations management platform including change management, service management, privileged access management system, etc.
Provide support to existing monitoring system and bridge the transition to an AI-enabled observability platform.
Qualifications Requirements
Possesses at least a Bachelor’s Degree in Computer Science/Information Technology or equivalent
Solid experience on managing large-scale infrastructure services on private and public cloud
Minimum 5 years hands-on-experience on designing and building Enterprise Observability and AIOps platform.
At least 5 years of hands-on working experience in: Enterprise monitoring systems (such as AppDynamic, Nagios, Thousand Eyes, Solarwind, Broadcom, Prognosis), Enterprise logging platform (such as ElasticSearch, Splunk), AIOps platform (such as Moogsoft, Microfocus Operation Bridge, Sciencelogic, Watson AIOps), Automation scripting (such as Ansible, Terraform, Powershell, etc.)
Good understanding of cloud monitoring tools on AWS, Azure and GCP
Demonstrate strong knowledge in both application and infrastructure domain with ability to develop automation script
Possess good technical knowledge in implementing, troubleshoot, performance tuning of hardware, operating system and system services.
Strong analytical skill, creative and thinking out of the box in problem solving
Excellent command of written and spoken English.
A good team player and able to work effectively at all levels of an organization with the ability to influence others to move towards consensus.
Proven ability to operate under pressure and meet challenging deadlines with minimum supervision