MAIN RESPONSIBILITIES
• Design, set up, and manage infrastructure on cloud platforms (AWS, GCP, Azure) or on-premise environments.
• Develop and maintain CI/CD pipelines, integrating automated testing and deployment for software products and AI models.
• Collaborate with development teams to package applications (Docker), manage containers (Kubernetes), and monitor system resources.
• Monitor, log, and alert on system performance, stability, and security.
• Automate operational tasks, optimize infrastructure costs, and ensure uptime for critical systems.
• Support the deployment of AI services, big data processing, or APIs for machine learning models in production environments.
• Create technical documentation and operational guides, assisting in troubleshooting.
• Ensure compliance with security standards and internal information system regulations
JOB REQUIREMENTS
• Bachelor’s degree in Information Technology, Computer Engineering, Information Systems, or equivalent.
• Minimum of 3 years of experience as a DevOps Engineer, Site Reliability Engineer, or System Engineer.
• Proficient in CI/CD tools such as Jenkins, GitLab CI, GitHub Actions, ArgoCD, or equivalent.
• Experienced in deploying and operating systems on cloud platforms (AWS, GCP, Azure) and container technologies (Docker, Kubernetes).
• Proficient in shell scripting and at least one programming/automation language such as Python, Bash, or Go.
• Knowledge of system monitoring configurations (Prometheus, Grafana, ELK/EFK stack, CloudWatch, etc.).
• Preference for candidates with experience supporting big data systems, AI, or ML model processing pipelines.
• Understanding of system security, IAM, SSL management, firewalls, data protection, and service security
Soft Skills:
• Systematic thinking and efficient problem-solving skills.
• Strong communication skills for collaboration with development and operations teams.
• Meticulous and highly responsible, especially with production systems.
• Continuous learning mindset and ability to stay updated with new technologies.
• Ability to work independently and adapt flexibly to various infrastructure types
BENEFITS & WELFARE
• Competitive salary based on skills and experience.
• Performance-based bonuses and recognition for contributions.
• Advanced technical environment with opportunities to work on large-scale data systems and modern AI solutions.
• Transparent and professional work culture, supporting long-term career development