What's Josh Ambati's tech stack?

React, Next.js, TypeScript, Python, Java, Spring Boot, Node.js on the application side. For AI — LLM orchestration, multi-model routing, prompt engineering, and inference optimization. Infrastructure includes AWS, Docker, Kubernetes, PostgreSQL, MongoDB, and Redis.

Is Josh Ambati open to relocation?

Currently based in Chicago, IL and open to opportunities there as well as remote roles. For the right opportunity, absolutely open to relocation.

What kind of roles is Josh Ambati looking for?

Full Stack Developer, AI Engineer, Software Engineer, MLOps, or Backend Engineer roles where he can build production systems and have real ownership over what he ships.

Inkrant is an AI product company built from scratch by Josh Ambati. The flagship product — Schools by Inkrant — is an AI-native operating system for schools, handling admissions, academics, attendance, payments, and communication with 15+ AI-powered features.

Does Josh Ambati have experience with AI/ML in production?

Yes. He has built and shipped multi-model inference systems, dynamic model routing (LLaMA, GPT-4o, Gemini Flash, Mixtral), memory-augmented generation, voice AI agents, and prompt pipelines — all running in production serving real users.

What sets Josh Ambati apart from other candidates?

He built an entire product company from zero — database design to deployment, AI architecture to user-facing features. He doesn't just write code, he builds systems. He thinks like an owner, ships fast, and cares deeply about the end result.

All Posts

AI Systems10 min read

Building Inkrant: AI-Native Education from Zero to Production

March 15, 2026

Every product starts with a frustration. For me, it was watching schools drown in data they couldn't act on.

Student performance records, attendance patterns, behavioral flags — all sitting in spreadsheets and legacy systems, disconnected from the decisions that matter. Teachers were making promotion decisions based on gut feel and incomplete information.

So I started building Inkrant.

The core insight was that education AI doesn't need one massive model. It needs the right model for the right task. Small queries — attendance lookups, quick summaries — go to LLaMA 3.1 8B. It's fast and cheap. Complex reasoning — performance prediction, promotion decisions — routes to LLaMA 3.3 70B. Dynamic routing based on task complexity.

This dual-model architecture reduced our inference costs by 70% compared to routing everything through a large model. But cost wasn't the real win.

The real win was Memory-Augmented Generation. Unlike vanilla RAG that retrieves context per-query, our system builds persistent memory — learned patterns, time-series insights, contextual understanding that compounds over time. The AI doesn't just answer questions. It develops an understanding of each student.

We built 4 core AI services powering 15+ features: student performance prediction, attendance forecasting, promotion decision systems, and more. Each service uses role-based prompt pipelines with context injection and temperature tuning optimized for the specific task.

The infrastructure is provider-agnostic — we can switch between OpenAI, Together AI, and Gemini without code changes. This flexibility has been critical as the LLM landscape evolves.

Inkrant is now a platform I'm proud of. Not because it's perfect, but because it solves a real problem for real schools with real students.

All posts