Skip to content
View goanto's full-sized avatar
🀠
🀠

Block or report goanto

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
goanto/README.md

Dr. Geo Johns Antony

Freelance Software Architect, DevOps, Cloud & AI Infrastructure Specialist | PhD in Distributed Systems

Location ORCID LinkedIn


πŸ‘‹ About Me

I'm a Software Architect and Platform Engineer with 8+ years of experience building production-grade AI/LLM infrastructure, cloud-native platforms, and distributed systems. I hold a PhD in Distributed Systems from Inria Labs (France).

My expertise lies at the intersection of AI infrastructure, cloud platforms, and distributed systems. I've built and scaled platforms serving 200,000+ daily users , where I architected multi-provider LLM routing platforms and cross-platform mobile applications, led engineering teams, and published research in top-tier conferences. I'm passionate about solving complex technical challenges and delivering high-impact solutions.

Currently available for freelance consulting, development and architecture projects.


🎯 What I Offer

AI/LLM Infrastructure Consulting

  • Production-grade LLM integration (OpenAI, Anthropic Claude, Grok)
  • Multi-provider routing and failover strategies
  • Real-time latency guardrails and cost optimization
  • AI agent orchestration and workflow automation
  • AWS Bedrock, API Gateway, and serverless architectures

Cloud Platform Architecture

  • Multi-cloud solutions: AWS, GCP, Azure
  • Kubernetes (EKS, GKE, AKS) cluster design and optimization
  • Infrastructure-as-Code with Terraform and Ansible
  • CI/CD pipeline design (GitHub Actions, Jenkins, Argo)
  • Auto-scaling, load balancing, and high-availability systems

Platform Engineering & DevOps

  • Microservices architecture design
  • API design (REST, gRPC, GraphQL)
  • Container orchestration and Helm chart development
  • Monitoring and observability (CloudWatch, ELK Stack, Prometheus)
  • Database optimization (PostgreSQL, MongoDB, Redis)

MLOps & Data Infrastructure

  • Multi-tenant ML platforms
  • Model deployment pipelines
  • GPU workload orchestration
  • Data pipeline design (Kafka, RabbitMQ, Elasticsearch)
  • Feature engineering and data processing at scale

Technical Leadership

  • Architecture design and technical roadmap planning
  • Team leadership and mentorship
  • Code reviews and best practices implementation
  • System design and scalability consulting

πŸ’Ό Key Project Experience

Multi-Provider LLM Routing Platform | Co-Founder & CTO at Optimitz

Technologies: AWS EKS, Python (FastAPI), Node.js, OpenAI, Claude, Grok, React Native

  • Architected production-grade LLM infrastructure with intelligent request routing across multiple providers (OpenAI GPT-4, Grok-4, Claude)
  • Implemented real-time latency guardrails ensuring sub-500ms response times
  • Built cost optimization layer reducing LLM inference costs by 35%
  • Developed cross-platform mobile apps (iOS/Android) using React Native
  • Designed scalable backend microservices handling 10k+ daily API requests
  • Impact: Enabled seamless AI integration for enterprise clients with 99.9% uptime

Multi-Tenant Video Analytics Platform | Founding Software Engineer (MLOps) at Aivid.ai

Technologies: AWS EKS, PostgreSQL, MongoDB, Redis, Python, Kubernetes

  • Built and scaled video analytics platform serving 200,000+ daily users across multiple tenants
  • Optimized database layer (PostgreSQL, MongoDB, Redis) achieving 40% cost reduction and sub-100ms query response times
  • Implemented Kubernetes auto-scaling policies handling variable workloads (10x traffic spikes)
  • Designed multi-tenant isolation strategy ensuring data security and compliance
  • Led 5-person engineering team through rapid growth phase
  • Impact: Delivered production platform processing 1M+ video analytics requests daily

MLOps Platform for Data Scientists | Software Engineer (MLOps) at Reliance Industries

Technologies: GKE, Terraform, Node.js, React, ELK Stack, Jenkins

  • Designed and deployed MLOps platform serving 60+ data scientists across multiple teams
  • Built full-stack platform with React frontend and Node.js backend
  • Implemented blue-green deployment strategy achieving zero-downtime releases
  • Automated infrastructure provisioning using Terraform on GCP (GKE)
  • Set up monitoring with ELK Stack providing real-time observability
  • Impact: Reduced model deployment time from 2 days to 2 hours; received Employee of the Year Award

Distributed Agent Framework for Geo-Distributed Systems | PhD Research at Inria Labs

Technologies: Python, Go, Kubernetes, Terraform, Ansible, gRPC

  • Developed novel distributed agent framework for managing workloads across geo-distributed edge and cloud environments
  • Implemented custom distributed consensus algorithm in Go extending beyond RAFT/Paxos for specific use cases
  • Designed dependency graph algorithm for Kubernetes resource management
  • Automated infrastructure deployment across 50+ servers using Terraform and Ansible
  • Built microservices architecture with REST APIs and gRPC for inter-service communication
  • Published 5 peer-reviewed research papers in IEEE, Springer, and ACM conferences
  • Impact: Novel approach to distributed coordination reducing cross-region latency by 40%

Current Freelance Projects

Technologies: AWS (EKS, Bedrock, Lambda, RDS, S3), Azure (AKS), Python, VAPI, Amazon Transcribe

  • Architecting cloud-native applications on AWS EKS with auto-scaling and high availability
  • Integrating AI services: VAPI voice AI, Amazon Transcribe, ChatGPT-5, Claude APIs
  • Deploying Azure Kubernetes Service (AKS) for ML workloads with GPU node pools (NVIDIA A100)
  • Building Python microservices for infrastructure management with HashiCorp Vault integration
  • Implementing event-driven architectures with AWS Lambda and API Gateway
  • Impact: Delivered scalable AI-powered solutions reducing client operational costs by 30%

πŸ”§ Technical Expertise

Cloud Platforms & Infrastructure

☁️  AWS: EKS, Bedrock, API Gateway, Lambda, S3, RDS, CloudWatch, EC2, VPC
☁️  GCP: GKE, Cloud Storage, Cloud Functions, BigQuery, Cloud SQL
☁️  Azure: AKS, Private Networks, GPU Node Pools, Azure DevOps

AI/LLM Stack

πŸ€–  LLM Providers: OpenAI (GPT-4, GPT-5), Anthropic (Claude), Grok
πŸ€–  AI Services: AWS Bedrock, VAPI (Voice AI), Amazon Transcribe
πŸ€–  Frameworks: LangChain, AI agent orchestration, prompt engineering
πŸ€–  Optimization: Multi-provider routing, latency guardrails, cost management

Container Orchestration & DevOps

🐳  Kubernetes: EKS, GKE, AKS, Helm, Custom Operators, Auto-scaling
🐳  Infrastructure-as-Code: Terraform, Ansible, CloudFormation
🐳  CI/CD: GitHub Actions, Jenkins, Argo CD, Blue-Green Deployments
🐳  Containers: Docker, Docker Compose, Container registries (ECR, GCR)

Programming Languages

πŸ’»  Primary: Python (FastAPI, Flask, Django), Go (microservices, CLI tools)
πŸ’»  Backend: Node.js (Express, NestJS), TypeScript
πŸ’»  Systems: C++ (performance-critical components), Bash scripting
πŸ’»  Mobile: Swift (iOS), Kotlin (Android), React Native, Flutter

Backend & API Design

πŸ”Œ  Frameworks: FastAPI, Flask, Node.js/Express, Django
πŸ”Œ  API Styles: REST, gRPC, GraphQL, WebSockets
πŸ”Œ  Architecture: Microservices, Event-driven, Serverless
πŸ”Œ  Authentication: JWT, OAuth2, API keys, HashiCorp Vault

Databases & Data Stores

πŸ’Ύ  Relational: PostgreSQL (optimization, replication, partitioning)
πŸ’Ύ  NoSQL: MongoDB, CouchDB, Elasticsearch
πŸ’Ύ  Caching: Redis (pub/sub, caching strategies)
πŸ’Ύ  Message Queues: Kafka, RabbitMQ, AWS SQS
πŸ’Ύ  Search: Elasticsearch, OpenSearch

Monitoring & Observability

πŸ“Š  Logging: ELK Stack (Elasticsearch, Logstash, Kibana), AWS CloudWatch
πŸ“Š  Metrics: Prometheus, Grafana, CloudWatch Metrics
πŸ“Š  Tracing: Distributed tracing, APM tools
πŸ“Š  Alerting: PagerDuty, CloudWatch Alarms, Slack integrations

Mobile Development

πŸ“±  Cross-platform: React Native, Flutter
πŸ“±  Native iOS: Swift, Xcode, App Store deployment
πŸ“±  Native Android: Kotlin, Android Studio, Play Store deployment
πŸ“±  Backend Integration: REST APIs, real-time data sync, push notifications

πŸŽ“ Education & Research

Ph.D. in Computer Science | Inria Labs & IMT Atlantique, France (2021-2024)

Research Focus: Distributed Systems, Cloud-Edge Computing, Consensus Algorithms

  • Developed novel distributed agent framework for geo-distributed workload management
  • Designed custom consensus algorithm extending beyond RAFT/Paxos for specific coordination patterns
  • Published 5 peer-reviewed papers in top-tier conferences (IEEE, Springer, ACM)
  • Automated large-scale infrastructure (50+ servers) for experimental validation

M.Tech in Computer Science | VIT, India (2018-2020)

Specialization: Software Engineering, Big Data Analytics

B.Tech in Computer Science | Christ University, India (2014-2018)


πŸ† Key Achievements

  • πŸ₯‡ Employee of the Year Award - Reliance Industries (2020)
  • πŸ“„ 5 Peer-Reviewed Publications - IEEE, Springer, ACM conferences and journals
  • πŸ‘₯ Led Engineering Teams - Managed and mentored 5+ engineers across multiple projects
  • πŸš€ Built 200k+ User Platforms - Delivered production systems serving hundreds of thousands of daily users
  • 🎯 40% Cost Reduction - Optimized database infrastructure reducing operational costs significantly
  • ⚑ Novel Research Contributions - Developed distributed consensus algorithm and dependency graph algorithms

πŸ“« Let's Work Together

I'm available for freelance Developing/ Consulting on:

  • AI/LLM infrastructure architecture and implementation
  • Cloud platform design and migration (AWS, GCP, Azure)
  • Platform engineering and DevOps transformation
  • MLOps pipeline design and optimization
  • Distributed systems architecture
  • Technical advisory and CTO-as-a-Service

Contact Me:


πŸ’‘ Technical Philosophy

"I believe in building systems that are not just scalable, but also maintainable and cost-effective. Every architectural decision should balance technical excellence with business pragmatism. Whether it's optimizing LLM costs, reducing database query times, or designing distributed consensus algorithms, the goal is always to deliver measurable impact."


Popular repositories Loading

  1. kubeflow_patch_seldon kubeflow_patch_seldon Public

    1

  2. GPU-QuickSort-Algorithm GPU-QuickSort-Algorithm Public

    Forked from khaman1/GPU-QuickSort-Algorithm

    Cuda

  3. azuredisk-csi-driver azuredisk-csi-driver Public

    Forked from kubernetes-sigs/azuredisk-csi-driver

    Azure Disk Container Storage Interface (CSI) Storage Plugin

    Go

  4. kubeflow kubeflow Public

    Forked from kubeflow/kubeflow

    Machine Learning Toolkit for Kubernetes

    Jsonnet

  5. kubeless kubeless Public

    Forked from vmware-archive/kubeless

    Kubernetes Native Serverless Framework

    Go

  6. seldon-core seldon-core Public

    Forked from SeldonIO/seldon-core

    Machine Learning Deployment for Kubernetes

    Python