Skip to content
View Islam-hady9's full-sized avatar

Block or report Islam-hady9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Islam-hady9/README.md

Hi 👋, I'm Islam Abd-Elhady

AI/ML Engineer | LLM Systems, Generative AI, and MLOps


AI/ML Engineer with 2+ years of experience designing and deploying production-grade LLM systems, RAG pipelines, and multi-agent architectures serving thousands of concurrent users.
Specialized in LLMOps, MLOps, GPU optimization, and scalable microservices on AWS, Azure, and Kubernetes.
Delivered systems that cut design cycles by 90%, reduced GPU memory usage by 40%, and sustained high-throughput inference across distributed workloads.


🔹 Core Expertise

  • 🧠 LLM Systems & Agents — RAG pipelines, multi-agent orchestration, prompt engineering, guardrails (LangChain, LangGraph, LlamaIndex)
  • ⚙️ MLOps & LLMOps — model deployment, inference optimization, CI/CD, observability, distributed tracing
  • 🎨 Generative AI — LLM fine-tuning (LoRA, PEFT, QLoRA), Stable Diffusion, SDXL, diffusion pipelines
  • 🚀 Scalable Infrastructure — FastAPI, WebSocket, Docker, Kubernetes, Redis, Celery, microservices architecture
  • 🧮 GPU Optimization — xFormers, ONNX, TensorRT, quantization, mixed-precision inference, CUDA

🚀 Current Work

  • Zedny INC (Cairo, Egypt)AI & DevOps Engineer architecting production AI platforms on microservices, delivering scalable LLM inference across distributed GPU clusters with 99.9% uptime, end-to-end MLOps/LLMOps pipelines, and Kubernetes-based autoscaling.
  • VEEM Solutions (Saudi Arabia) – Developing a large multi-agent system with RAG, tools, templates, and scalable deployment; building the next evolution of brand intelligence (Shrwd.ai).
  • Freelance — Delivered 10+ RAG systems, multi-agent architectures, and real-time voice assistants deployed on Azure, AWS, and SaladCloud.

🏆 Recent Highlights

  • Built the DataOps LLM Engine — an LLM-powered data operations engine with a 7-layer security architecture (AST validation, sandboxed execution, audit logging) enabling natural-language interaction with Excel, CSV, and pandas DataFrames.
  • Architected a large-scale multi-agent LLM system at Shrwd.ai with dynamic tool orchestration, improving answer relevance by 40% and reducing hallucinations by 50%.
  • Fine-tuned Stable Diffusion with LoRA for NFT Wear AI, compressing design cycles from 4+ hours to under 5 minutes (95% reduction).

📄 My CV

📥 Download CV (PDF)
🌐 GitHub Pages CV site


📧 Connect with me

LinkedIn Gmail Facebook GitHub


🛠️ Tech Stack

  • Languages: Python, C/C++, Java, C#, Go, Bash, SQL
  • LLMs & GenAI: LangChain, LangGraph, LlamaIndex, Hugging Face Transformers, Diffusers, LiteLLM, OpenAI, Anthropic, Google Gemini
  • ML & DL: PyTorch, TensorFlow, scikit-learn, Stable Diffusion, SDXL, CLIP, DINOv2, SAM, faster-whisper
  • Vector DBs & Retrieval: FAISS, Qdrant, Chroma, Pinecone, hybrid retrieval, re-ranking
  • MLOps & LLMOps: CI/CD, model versioning, observability, xFormers, ONNX, TensorRT, quantization
  • Cloud & Infra: AWS (EC2 g5.x GPU, S3, IAM), Azure (VMs, Container Apps), SaladCloud, Docker, Kubernetes, Helm, GitHub Actions
  • Backend: FastAPI, WebSocket, REST/Async APIs, Redis, Celery, microservices, distributed systems

💻 Languages & Tools

Python C++ C Java C# Go Bash SQL

🤖 AI / ML / LLM

PyTorch TensorFlow Keras scikit-learn Hugging Face LangChain LangGraph LlamaIndex OpenAI Anthropic Gemini FAISS OpenCV ONNX TensorRT

⚙️ Backend & Infrastructure

FastAPI Flask Redis Docker Kubernetes AWS Azure SaladCloud GitHub Actions Git Linux MySQL


📊 GitHub Stats

GitHub Streak

Profile Summary

Repos per Language Most Commit Language
Stats Productive Time

Contribution Graph

Profile views Followers

Pinned Loading

  1. dataops-llm-engine dataops-llm-engine Public

    DataOps LLM Engine

    Python

  2. deep-cuda deep-cuda Public

    Image Classification with CNN in CUDA C++

    Jupyter Notebook

  3. application-to-help-the-blind-eyes-mate application-to-help-the-blind-eyes-mate Public

    EyesMate is a graduation project designed to assist visually impaired individuals using advanced machine learning technologies.

    Jupyter Notebook 1 3

  4. SmartAI_CustomerSupport SmartAI_CustomerSupport Public

    Smart Customer Support Assistant

    Jupyter Notebook 1

  5. Generative-AI-Models Generative-AI-Models Public

    Generative AI Models is a comprehensive repository dedicated to the implementation of cutting-edge generative AI models using Python. It features various models, including those for image captionin…

    Jupyter Notebook 2

  6. BrainTumorDetection-API BrainTumorDetection-API Public

    Brain Tumor Detection using CNN & FastAPI.

    Jupyter Notebook 2