Hi, I'm Yazan.
I'm an AIOps Engineer. I love applying AI Agent Engineering with MLOps to transform infrastructure, turning manual operations into intelligent, self-healing systems powered by SRE and DevOps best practices.
Over the past several years, I've had the privilege of working with amazing teams. I specialize in AI-driven automation, MLOps pipelines, intelligent observability, and building AI-powered infrastructure platforms.
Beyond the code, I am an active contributor to open-source software, spending the last 4+ years sharing what I know as a Technical Writer for the Fedora Project. I genuinely enjoy the process of learning and helping other developers scale their projects seamlessly.
Professional Background
Senior DevOps Engineer
Governata
Saudi Arabia · Remote
Unified platform to streamline and elevate all data governance operations. Core skills include Terraform, Amazon Web Services (AWS), and others.
Senior Site Reliability Engineer
Amman, Jordan · Remote
Leveraged AWS, Git, and enterprise cloud services to deliver infrastructure solutions, focusing on enhancing automation, seamless releases, and reliable infrastructure scale.
Senior Site Reliability Engineer
Amman, Jordan · On-site
- Release Management: Led end-to-end release processes, coordinating with cross-functional teams to ensure smooth and timely deployments with minimal downtime.
- CI/CD & Automation: Architected robust pipelines using Jenkins, Ansible, Python, and Bash to automate testing, building, and deployment stages.
- Incident Response: Acted as a point of contact for incident response, rapidly diagnosing and resolving production issues to ensure seamless user experiences.
- Observability & ChatOps: Pioneered monitoring with Zabbix and ELK Stack for MongoDB. Built ChatOps workflows using AWS Chatbot, SNS, CloudWatch, and Slack for real-time alerting.
Site Reliability Engineer
Amman, Jordan · Hybrid
- Automation Infrastructure: Spearheaded a Jenkins automation server to streamline backup and restore services in production, increasing data reliability.
- Operational Optimization: Designed automated solutions for routine operational tasks to reduce human error and boost system performance.
- Multi-Cloud Management: Managed public cloud platforms including AWS, OVH, DigitalOcean, and Platform.sh to ensure high availability and scalability.
- Incident Management: Proactively led resolution of production incidents and handled on-call rotations to maintain system integrity.
DevOps Engineer
Amman, Jordan · On-site
- Built and configured CI/CD systems to streamline deployment workflows.
- Utilized Ansible for production system and service configuration management.
- Established a comprehensive monitoring system using Nagios.
- Managed cloud infrastructure across DigitalOcean and AWS.
Extracurricular Activities
Technical Writer
Publishing deep-dive technical guides on container technologies like Podman and systemic administration, reaching a global audience of developers.
Elsewhere
Let's work together
Have a project in mind? Let's discuss how I can help with AIOps, AI Agent Engineering, MLOps, or cloud infrastructure.