AI Operations Engineer
Job Type: Contract (6 months, potential to renew for 12 months)
Location: Remote (LATAM)
Compensation: in USD
At Talentus, we are looking for you!
We are a US company with a strong presence in LATAM and across 20 countries worldwide. Our key near-shore BPO services include smart-sourcing, dedicated or cluster teams, managed IT services, software outsourcing, and top ERP & CRM solutions, driven by our practices across diverse industries.
We are currently seeking an AI Operations Engineer to join one of our US clients. The AI Operations Engineer will play a crucial role in supporting implementations and managing IT infrastructure environments to ensure smooth operations. The ideal candidate will have a strong background in data science, experience with AI/ML, and familiarity with ServiceNow, particularly in the context of automation and incident management.
Responsibilities:
- Implementation & Environment Management:
Support the implementation and management of AI/ML solutions within the IT infrastructure.
Ensure operational efficiency across different environments by monitoring performance and troubleshooting issues.
- Monitoring Strategy:
Assist in defining and managing the enterprise monitoring strategy for observability and event management.
Develop a comprehensive monitoring approach covering metrics, logs, traces, alerts, dashboards, and reports.
- Automation & AI Integration:
Leverage AI and machine learning techniques for predictive analytics, incident forecasting, and automating the detection, diagnosis, and resolution of incidents.
Collaborate with IT teams to optimize monitoring processes and tools, ensuring alignment with business requirements.
- Collaboration & Communication:
Work closely with end users, production support teams, and stakeholders to gather requirements and deliver effective monitoring solutions.
Provide guidance and support to users on maximizing the benefits of monitoring capabilities.
- Tool Evaluation & Integration:
Evaluate and select appropriate monitoring tools and platforms, integrating them with existing IT systems and processes.
Assist in managing the roadmap for monitoring and automation tools, proposing innovative solutions in line with technology strategy.
- Performance Management:
Establish and maintain service level agreements (SLAs) and key performance indicators (KPIs) to monitor and report on system performance and quality.
Collaborate with IT teams to ensure best practices are followed across the enterprise.
Required Skills:
- Background in Data Science or a related field.
- Familiarity with AI/ML concepts and automation, particularly within ServiceNow.
- Experience working in IT operations or on an IT operations team.
- Strong knowledge of observability and event management best practices.
- Familiarity with multi-cloud platforms and ITIL processes, especially in event and incident management.
- Experience with monitoring tools such as SL1 and ServiceNow ITOM.
- Excellent problem-solving skills and the ability to work in a dynamic environment.
- Strong communication skills for collaboration with technical and non-technical stakeholders.
What do we offer?
- Full-time remote role
- Competitive hourly rate paid in $USD
- Opportunities for career growth and professional development
- Chance to collaborate with a global team across diverse industries and countries
- Continuous Learning and Improvement with access to Udemy courses
If you are passionate about leveraging AI and automation to enhance IT operations and are looking for a dynamic opportunity, we want to hear from you!