Manju Malateshappa — AI/ML Architect

About

From hands-on engineer to AI architect — still hands-on.

I'm an AI/ML Architect at Caylent, focused on Generative AI, LLMOps, and cloud-native ML on AWS. My work spans architecture and delivery for Fintech, Automotive, MarTech, MedTech, and EdTech clients — translating fuzzy business asks into systems that ship.

I've led AI pipelines that processed 15M+ records, deployed fine-tuned open-source LLMs to production for a platform used by 25,000+ organizations, and built a GenAI document pipeline that cut financial-statement processing from 30–60 minutes to ~90 seconds. I co-authored Caylent's MLOps Solution Offering — a reusable architecture blueprint now used company-wide.

Outside delivery, I mentor engineers, contribute to open-source experiments, and write on responsible-AI topics — a thread that goes back to my graduate work at SFU, where I published on model fairness & transparency.

Experience

A career built on shipping AI to production.

May 2026 — Present Vancouver, Canada

Machine Learning Architect · Caylent

Promoted to ML Architect; lead end-to-end architecture and delivery of scalable GenAI and ML solutions on AWS — from discovery through production rollout — with focus on cost-performance trade-offs, stakeholder alignment, and maintainable systems.
Dec 2023 — May 2026 Vancouver, Canada
Senior Machine Learning Engineer · Caylent
- Led architecture, delivery, and client engagement across 6 enterprise engagements (Fintech, Automotive, MarTech, MedTech, EdTech).
- Designed GenAI systems on Amazon Bedrock: RAG pipelines, vector retrieval, agent-based architectures (ReAct, planner-executor, multi-agent).
- Established LLM evaluation frameworks (text-to-SQL, RAG quality, agent reasoning) and LLMOps practices: prompt versioning, model evaluation pipelines, observability.
- Co-authored Caylent's MLOps Solution Offering — adopted company-wide as a reference for client engagements.
- Mentored 2 engineers on GenAI architecture and MLOps to independent ownership of client deliverables.
Selected client outcomes

Fintech

97% faster financial statement processing

GenAI document pipeline: 30–60 min → ~90 s; >95% extraction accuracy; 10× throughput (50→500 docs); $0.36–0.47 per document, enabling same-day reviews.

MarTech / CRM

Fine-tuned LLMs in production for 25,000+ orgs

DistilBERT email classification (99.3% F1, 0.064s inference, $0.23/10k records) and Mistral-7B name extraction (97%+ accuracy) across beta, prod-us, prod-eu — among the first production deployments of fine-tuned Mistral-7B for enterprise NLP.

Automotive

Sole technical lead on multimodal AI avatar evaluation

Benchmarked 5 models (OmniAvatar, HunyuanVideo, Wan2.2, Wan2.2-S2V-14B, Amazon Nova Reel) across quality, generation time (2–35 min/18s video), and cost ($1.38–$2.98/video) on AWS p5.4xlarge (H100 80GB). Customer verdict: "A great success. Work of the highest quality."
Sep 2022 — Jun 2023 Vancouver, Canada
Senior AI Engineer · Alida Inc
- Led development and productization of AI services, managing a team of 4 engineers.
- Designed AI pipelines processing 15M+ records, securing a multi-year enterprise contract.
- Built reusable text analytics frameworks (PII masking, sentiment, taxonomy) on Node.js + Python; scaled on Terraform + AWS EKS.
- Managed MLOps for 13+ microservices — standardized CI/CD, testing, deployment.
- Helped grow the AI engineering team from 2 → 6 via interviews and recruiting; mentored on NLP, MLOps, cloud-native AI.
Apr 2020 — Sep 2022 Vancouver, Canada
AI/ML Engineer · Alida Inc
- Designed, deployed, and maintained 5 core NLP microservices using spaCy, NLTK, IBM NLU, AWS Translate.
- Built CI/CD with Terraform + GitHub Actions; deployed XGBoost ML pipelines to production via AWS EKS.
Jan 2014 — Aug 2019 Bangalore, India
Software Engineer · Nagra Kudelski

Grew from individual contributor to UI lead across 6+ projects serving 2M+ users (Chivas TV, Euskaltel TV, TBC Taiwan, FOXTEL Australia).
- Coordinated 4 engineers across 3 concurrent projects; led requirements, sprints, release sign-off.
- Key contributor at FOXTEL Australia: Cloud DVR, content protection, parental lock, GA-based recommendations.
- Supported live 4K content during the 2016 Rio Olympics — resolved production issues on-ground in Brazil, securing a 2-year contract extension.
- Mentored 6 engineers on UI architecture, code quality, and engineering practices.

Skills

The toolkit.

Generative AI & NLP

Amazon Bedrock
SageMaker AI
AgentCore
LLMs (Claude, ChatGPT, Mistral)
RAG
AI Agents
Prompt Engineering
Multimodal (TTS, T2V, I2V)
AWS Comprehend

MLOps & LLMOps

Model lifecycle
Evaluation frameworks
Text-to-SQL eval
RAG quality
Agent reasoning eval
Monitoring
CI/CD for ML

AI / ML Frameworks

PyTorch
Scikit-learn
MLflow
LangChain

Cloud & Infra

AWS (Lambda, Step Functions, EKS, S3, SQS, IAM)
Terraform
Docker
Kubernetes

Big Data & Orchestration

Apache Spark
Kafka
HDFS
Airflow

Web & Languages

Python
TypeScript
JavaScript
React
Node.js
Django

Databases

MySQL
PostgreSQL
Cassandra
MongoDB

CI/CD & Tooling

GitHub Actions
GoCD
Poetry
Git

Certifications

Anthropic Claude Certified Architect — Foundations
HashiCorp Certified: Terraform Associate (004)

See prep notes & approach →

Projects

Selected open-source & learning projects.

Live data from github.com/manju-malateshappa — refreshed each load.

Enable JavaScript to load live project data, or browse my repos directly on GitHub.

See all repositories on GitHub →

Achievements & Writing

A few things I'm proud of.

Hackathon · Winner

Alida Hackathon 2023

Automated PR generation and code reviews using Jira, Slack, Git diff, and LLMs via GitHub Actions.

Hackathon · Winner

Alida Hackathon 2022

XGBoost speedster-detection solution deployed on AWS SageMaker, integrated into the Alida platform.

Publication

Model Transparency & Fairness

SFU CS course project — detected and mitigated racial bias in ML classification (COMPAS dataset) with IBM AIF360, Google What-If, and SHAP. Improved Disparate Impact from 0.7 → 0.91 with XGBoost + Reweighing.

Read on Medium →

Publication · IEEE

Embedded systems research

Co-author on an IEEE paper from earlier in my career.

View on IEEE Xplore →

Education

Academic foundation.

Simon Fraser University

Burnaby, Canada · Aug 2019 – May 2021

M.Sc. Computer Science — Big Data & Machine Learning

University Visvesvaraya College of Engineering

Bangalore, India · Aug 2009 – Jun 2013

B.E. Computer Science

Contact

Let's build something together.

If you're working on Generative AI, LLMOps, or scaling AI on AWS — or just want to swap notes — I'd love to hear from you.

Email me Connect on LinkedIn Follow on GitHub

Building scalable Generative AI systems on AWS.