home experience projects skills now blog github contact
Senior SDE · Open to new opportunities

Abhishek Nalla

Software Engineer @ Amazon

Building distributed systems, ranking & recommendation engines, and AI-powered products at scale.
Currently shipping video experiences for Amazon Rufus — $600M+ revenue impact.
IIIT Hyderabad · IIT Madras · Seattle, WA

5+
yrs at Amazon
45%
engagement lift
85%
latency cut
$600M+
revenue influenced
experience

Where I've worked

Dec 2023
→ Present
Amazon
Senior Software Development Engineer
Seattle, WA · Full-time
  • Designed high-throughput video experiences, driving a 45% engagement lift and $350M+ in incremental revenue — built on top of ranking and recommendation models that surface the right video to the right customer.
  • Built distributed gRPC services for low-latency moment retrieval and clip serving in Amazon's AI assistant Rufus — improved ranking and relevance of video responses to product queries, achieving 7% CTR.
  • Drove Prime Day & Black Friday peak readiness — owned service reliability, caching strategies, and autoscaling policies across multiple Tier-1 services.
Jun 2020
→ Nov 2023
Amazon
Software Development Engineer II
Hyderabad, IN · Full-time
  • Designed and scaled a Tier-1 search service handling 10% MoM traffic growth — cut latency by 85% via DynamoDB optimization and Elasticsearch migration, enabling faster and more relevant search and recommendation results.
  • Built end-to-end driver license verification and onboarding automation using SQS queues and Step Functions — reduced failure rates from 16% to 1% and eliminated 75% of manual review effort with fault-tolerant async design.
Jun 2018
→ May 2019
IIIT Hyderabad · NLP Lab
Undergraduate Researcher
Hyderabad, IN
  • Research in neural text summarization (RNN/LSTM + attention) — 23% improvement in ROUGE-L on a 500K+ document corpus.
  • Trained Transformer models for mathematical reasoning: symbolic simplification and multi-step equation transformation.
May 2019
→ Jul 2019
Zenoti
Software Engineering Intern
Hyderabad, IN
  • Built event-driven webhook infrastructure for async callbacks, reducing synchronous API polling and improving 3P client integration reliability.
projects

Things I've built

Video Widget · Amazon Detail Pagesproduction
↑ $600M+ revenue · 10M+ req/day

Built the video product widget surfaced on Amazon detail pages — a high-traffic, latency-critical feature serving 10M+ requests/day. Solved the latency challenge using lazy loading and delayed REST calls so above-the-fold content renders instantly. Ranking and recommendation signals determine which videos surface per customer, driving $600M+ in increased revenue.

JavaREST APIsRankingRecommendationsLazy LoadingLatency Optimization
Video Answers on Rufusproduction
↑ 7% CTR · $350M+ revenue influenced

Built gRPC-based clip-generation and serving workflows for Amazon's AI shopping assistant Rufus. Core focus was ranking and relevance — surfacing the most useful video moment for each product question using ranking models, vector retrieval, and multimodal signals. Achieved 7% CTR.

gRPCRankingRelevanceRecommendationsLLM / BedrockVectorsJava
Elasticsearch Migrationshipped
↓ 85% latency · Tier-1 Amazon Last Mile

Migrated DynamoDB read-heavy access paths to Elasticsearch for a Tier-1 search service. Designed shard strategy, tuned indexing and query relevance, and built ranking signals into search — cutting latency by 85% and enabling more accurate, personalized recommendations at 10% MoM traffic growth.

ElasticsearchRankingDynamoDBJavaAWSDistributed Systems
Driver Onboarding Automationshipped
Failure rate 16% → 1% · 75% manual effort cut

End-to-end async event-driven system for driver license verification using SQS queues and AWS Step Functions — eliminated manual review bottlenecks at scale.

Step FunctionsSQSLambdaJavaEvent-driven
Neural Text Summarizationresearch
+23% ROUGE-L · 500K+ document corpus

Built RNN/LSTM & attention-based models to produce high-quality abstractive summaries. Trained and evaluated on a large corpus at IIIT Hyderabad's NLP Lab.

PythonTensorFlowRNN/LSTMAttentionNLP
Math Reasoning Transformersresearch
IIIT Hyderabad · NLP Lab

Trained Transformer models to learn symbolic math rules — simplification, factoring, and multi-step equation transformation — through structured reasoning tasks.

PythonTransformersTensorFlowSymbolic Math
Zenoti Webhook Platformshipped
Async · 3P Integration Reliability

Event-driven webhook infrastructure supporting async callbacks for third-party integrations — reduced synchronous polling load and improved reliability across client APIs.

WebhooksEvent-drivenREST APIsC#
skills

What I work with

languages
Java
Python
TypeScript
C++
SQL
backend & systems
Distributed Systems Microservices gRPC REST APIs Async Processing Event-driven Queues (SQS) Multithreading
cloud & infrastructure
AWS (Bedrock, Lambda) DynamoDB Elasticsearch Redis S3 / EC2 Step Functions Autoscaling Caching
AI / agentic
RAG Ranking & Relevance Vectors MCP Agentic Orchestration RNN/LSTM Transformers TensorFlow
now

What I'm doing right now

⚡ building
Video experiences for Amazon Rufus — shipping multimodal AI responses with low-latency clip serving at scale.
Exploring agentic workflows — RAG pipelines, MCP integrations, and LLM orchestration patterns.
Personal projects — building side projects to sharpen full-stack + AI skills outside of work.
📖 learning
Deep dive into LLM internals — attention mechanisms, fine-tuning, RLHF.
System design at Staff+ level — preparing to go broader and deeper across org boundaries.
Exploring new roles — open to senior/staff SDE opportunities in AI-first companies.
📍 location
Seattle, WA — open to California, remote, or hybrid roles across the US.
Available for on-site, hybrid, or fully remote positions.
🔖 reading
Designing Data-Intensive Applications — Martin Kleppmann
The Staff Engineer's Path — Tanya Reilly
blog

Writing & thoughts

Coming soon — planning to write about distributed systems, AI/ML engineering, and lessons from building at Amazon scale.

01

How we cut search latency by 85% at Amazon

DynamoDB → Elasticsearch migration: sharding, indexing, and lessons learned

Coming soonsystems
02

Building video AI for Amazon Rufus

gRPC clip serving, moment retrieval, and ranking at 10M+ requests/day

Coming soonAI/ML
03

Event-driven automation at scale

AWS Step Functions + SQS patterns for reliable onboarding workflows

Coming soonAWS
github

Open source & activity

5+
years coding
10M+
req/day served
4
major projects
23%
ROUGE-L lift
contribution activity · update your GitHub username below to link your real graph
less
more
github profile
→ github.com/abhishek-nalla

Update the href above with your real GitHub username after deploying.

contact

Let's connect

Open to new opportunities

Senior / Staff SDE roles · AI-first companies · Remote / Seattle / California

say hello →