Freelance Software Architect, DevOps, Cloud & AI Infrastructure Specialist | PhD in Distributed Systems
I'm a Software Architect and Platform Engineer with 8+ years of experience building production-grade AI/LLM infrastructure, cloud-native platforms, and distributed systems. I hold a PhD in Distributed Systems from Inria Labs (France).
My expertise lies at the intersection of AI infrastructure, cloud platforms, and distributed systems. I've built and scaled platforms serving 200,000+ daily users , where I architected multi-provider LLM routing platforms and cross-platform mobile applications, led engineering teams, and published research in top-tier conferences. I'm passionate about solving complex technical challenges and delivering high-impact solutions.
Currently available for freelance consulting, development and architecture projects.
- Production-grade LLM integration (OpenAI, Anthropic Claude, Grok)
- Multi-provider routing and failover strategies
- Real-time latency guardrails and cost optimization
- AI agent orchestration and workflow automation
- AWS Bedrock, API Gateway, and serverless architectures
- Multi-cloud solutions: AWS, GCP, Azure
- Kubernetes (EKS, GKE, AKS) cluster design and optimization
- Infrastructure-as-Code with Terraform and Ansible
- CI/CD pipeline design (GitHub Actions, Jenkins, Argo)
- Auto-scaling, load balancing, and high-availability systems
- Microservices architecture design
- API design (REST, gRPC, GraphQL)
- Container orchestration and Helm chart development
- Monitoring and observability (CloudWatch, ELK Stack, Prometheus)
- Database optimization (PostgreSQL, MongoDB, Redis)
- Multi-tenant ML platforms
- Model deployment pipelines
- GPU workload orchestration
- Data pipeline design (Kafka, RabbitMQ, Elasticsearch)
- Feature engineering and data processing at scale
- Architecture design and technical roadmap planning
- Team leadership and mentorship
- Code reviews and best practices implementation
- System design and scalability consulting
Technologies: AWS EKS, Python (FastAPI), Node.js, OpenAI, Claude, Grok, React Native
- Architected production-grade LLM infrastructure with intelligent request routing across multiple providers (OpenAI GPT-4, Grok-4, Claude)
- Implemented real-time latency guardrails ensuring sub-500ms response times
- Built cost optimization layer reducing LLM inference costs by 35%
- Developed cross-platform mobile apps (iOS/Android) using React Native
- Designed scalable backend microservices handling 10k+ daily API requests
- Impact: Enabled seamless AI integration for enterprise clients with 99.9% uptime
Technologies: AWS EKS, PostgreSQL, MongoDB, Redis, Python, Kubernetes
- Built and scaled video analytics platform serving 200,000+ daily users across multiple tenants
- Optimized database layer (PostgreSQL, MongoDB, Redis) achieving 40% cost reduction and sub-100ms query response times
- Implemented Kubernetes auto-scaling policies handling variable workloads (10x traffic spikes)
- Designed multi-tenant isolation strategy ensuring data security and compliance
- Led 5-person engineering team through rapid growth phase
- Impact: Delivered production platform processing 1M+ video analytics requests daily
Technologies: GKE, Terraform, Node.js, React, ELK Stack, Jenkins
- Designed and deployed MLOps platform serving 60+ data scientists across multiple teams
- Built full-stack platform with React frontend and Node.js backend
- Implemented blue-green deployment strategy achieving zero-downtime releases
- Automated infrastructure provisioning using Terraform on GCP (GKE)
- Set up monitoring with ELK Stack providing real-time observability
- Impact: Reduced model deployment time from 2 days to 2 hours; received Employee of the Year Award
Technologies: Python, Go, Kubernetes, Terraform, Ansible, gRPC
- Developed novel distributed agent framework for managing workloads across geo-distributed edge and cloud environments
- Implemented custom distributed consensus algorithm in Go extending beyond RAFT/Paxos for specific use cases
- Designed dependency graph algorithm for Kubernetes resource management
- Automated infrastructure deployment across 50+ servers using Terraform and Ansible
- Built microservices architecture with REST APIs and gRPC for inter-service communication
- Published 5 peer-reviewed research papers in IEEE, Springer, and ACM conferences
- Impact: Novel approach to distributed coordination reducing cross-region latency by 40%
Technologies: AWS (EKS, Bedrock, Lambda, RDS, S3), Azure (AKS), Python, VAPI, Amazon Transcribe
- Architecting cloud-native applications on AWS EKS with auto-scaling and high availability
- Integrating AI services: VAPI voice AI, Amazon Transcribe, ChatGPT-5, Claude APIs
- Deploying Azure Kubernetes Service (AKS) for ML workloads with GPU node pools (NVIDIA A100)
- Building Python microservices for infrastructure management with HashiCorp Vault integration
- Implementing event-driven architectures with AWS Lambda and API Gateway
- Impact: Delivered scalable AI-powered solutions reducing client operational costs by 30%
βοΈ AWS: EKS, Bedrock, API Gateway, Lambda, S3, RDS, CloudWatch, EC2, VPC
βοΈ GCP: GKE, Cloud Storage, Cloud Functions, BigQuery, Cloud SQL
βοΈ Azure: AKS, Private Networks, GPU Node Pools, Azure DevOps
π€ LLM Providers: OpenAI (GPT-4, GPT-5), Anthropic (Claude), Grok
π€ AI Services: AWS Bedrock, VAPI (Voice AI), Amazon Transcribe
π€ Frameworks: LangChain, AI agent orchestration, prompt engineering
π€ Optimization: Multi-provider routing, latency guardrails, cost management
π³ Kubernetes: EKS, GKE, AKS, Helm, Custom Operators, Auto-scaling
π³ Infrastructure-as-Code: Terraform, Ansible, CloudFormation
π³ CI/CD: GitHub Actions, Jenkins, Argo CD, Blue-Green Deployments
π³ Containers: Docker, Docker Compose, Container registries (ECR, GCR)
π» Primary: Python (FastAPI, Flask, Django), Go (microservices, CLI tools)
π» Backend: Node.js (Express, NestJS), TypeScript
π» Systems: C++ (performance-critical components), Bash scripting
π» Mobile: Swift (iOS), Kotlin (Android), React Native, Flutter
π Frameworks: FastAPI, Flask, Node.js/Express, Django
π API Styles: REST, gRPC, GraphQL, WebSockets
π Architecture: Microservices, Event-driven, Serverless
π Authentication: JWT, OAuth2, API keys, HashiCorp Vault
πΎ Relational: PostgreSQL (optimization, replication, partitioning)
πΎ NoSQL: MongoDB, CouchDB, Elasticsearch
πΎ Caching: Redis (pub/sub, caching strategies)
πΎ Message Queues: Kafka, RabbitMQ, AWS SQS
πΎ Search: Elasticsearch, OpenSearch
π Logging: ELK Stack (Elasticsearch, Logstash, Kibana), AWS CloudWatch
π Metrics: Prometheus, Grafana, CloudWatch Metrics
π Tracing: Distributed tracing, APM tools
π Alerting: PagerDuty, CloudWatch Alarms, Slack integrations
π± Cross-platform: React Native, Flutter
π± Native iOS: Swift, Xcode, App Store deployment
π± Native Android: Kotlin, Android Studio, Play Store deployment
π± Backend Integration: REST APIs, real-time data sync, push notifications
Research Focus: Distributed Systems, Cloud-Edge Computing, Consensus Algorithms
- Developed novel distributed agent framework for geo-distributed workload management
- Designed custom consensus algorithm extending beyond RAFT/Paxos for specific coordination patterns
- Published 5 peer-reviewed papers in top-tier conferences (IEEE, Springer, ACM)
- Automated large-scale infrastructure (50+ servers) for experimental validation
Specialization: Software Engineering, Big Data Analytics
- π₯ Employee of the Year Award - Reliance Industries (2020)
- π 5 Peer-Reviewed Publications - IEEE, Springer, ACM conferences and journals
- π₯ Led Engineering Teams - Managed and mentored 5+ engineers across multiple projects
- π Built 200k+ User Platforms - Delivered production systems serving hundreds of thousands of daily users
- π― 40% Cost Reduction - Optimized database infrastructure reducing operational costs significantly
- β‘ Novel Research Contributions - Developed distributed consensus algorithm and dependency graph algorithms
I'm available for freelance Developing/ Consulting on:
- AI/LLM infrastructure architecture and implementation
- Cloud platform design and migration (AWS, GCP, Azure)
- Platform engineering and DevOps transformation
- MLOps pipeline design and optimization
- Distributed systems architecture
- Technical advisory and CTO-as-a-Service
- π§ Email: [geo8.antony@gmail.com]
- πΌ LinkedIn: [https://www.linkedin.com/in/goanto/]
- π Location: France (Remote worldwide)
- π Availability: Open for projects starting immediately
"I believe in building systems that are not just scalable, but also maintainable and cost-effective. Every architectural decision should balance technical excellence with business pragmatism. Whether it's optimizing LLM costs, reducing database query times, or designing distributed consensus algorithms, the goal is always to deliver measurable impact."



