AI System Monitoring Specialist
SKILLS
FULL DESCRIPTION
Summary
The AI System Monitoring Specialist will analyze AI system logs, develop monitoring tools, automate issue detection, collaborate with engineers, and communicate performance reports. Required skills include Python, SQL, experience with monitoring tools like Grafana, knowledge of MLOps, and a degree in a related field. Experience with big data tools like Hadoop or Spark is a plus.
Key Responsibilities:
- Analyze AI system logs to identify anomalies.
- Develop tools for real-time AI monitoring.
- Utilize AI tools to automate issue detection and resolution.
- Collaborate with engineers to improve system stability.
- Communicate performance reports to stakeholders.
Core Requirements:
- Experience with Python, SQL, and monitoring tools like Grafana.
- Solid understanding of machine learning operations (MLOps).
- Knowledge of big data tools like Hadoop or Spark is a plus.
- Strong analytical and teamwork skills.
- Bachelor’s or Master’s in Data Science, AI, or related field.
Responsibilities:
Analyze AI system logs to identify anomalies. Develop tools for real-time AI monitoring. Utilize AI tools to automate issue detection and resolution. Collaborate with engineers to improve system stability. Communicate performance reports to stakeholders.
Requirements:
Experience with Python, SQL, and monitoring tools like Grafana. Solid understanding of machine learning operations (MLOps). Knowledge of big data tools like Hadoop or Spark is a plus. Strong analytical and teamwork skills. Bachelor’s or Master’s in Data Science, AI, or related field.