Wei Wang

hi@wangwei.dev
+1 (425) 417-8117
Full Stack AI Engineer
10YOE
Redmond, WA
weiwio
shadowwalker
wangwei.dev

Summary

Senior full-stack software engineer with 10+ years of experience building large-scale cloud platforms, developer tools, and production AI systems at Amazon, Microsoft, and Roblox. Passionate about AI and Agentic AI, with hands-on experience build LLM inference infrastructure, agent workflows, and AI-powered developer tools. Open-source contributor and founder of widely adopted projects, combining deep systems thinking with product-driven execution.

Experience

Amazon

April 2025 - Now

Software Engineer II, AI Infrastructure

Bellevue, WA, USA
  • Architected Kubernetes-based LLM inference infrastructure (vLLM, SGLang) using helm charts and LeaderWorkerSet to reliably serve 100B+ parameter models in production, reducing NVMe model-load latency by 40% and improving startup stability at scale.
  • Built a HPC capacity management portal (Next.js/React) with automated AWS resource validation (VPC/ENI), eliminating manual checks and preventing failed scale-up events.
  • Led early adoption of Agentic AI by building MCP auto-discovery tooling and custom agents for Amazon Kiro CLI, enabling faster developer workflows and accelerating AI-native infrastructure adoption across teams.
  • Mentored engineers on the team to deliver a cloud-native observability tool for job failure visualization that significantly reduced infrastructure troubleshooting time.

Amazon

Jul 2022 - March 2025

Software Engineer II, Last Mile Transportation Technology

Bellevue, WA, USA
  • Engineered next-gen ML-based capacity planning system (UI/API/ML Orchestration), expanded the system to US, MX, and CA markets. Reduced 52-week cost forecast errors to <20% and slashed planner override rates to <20%, directly optimizing long-term delivery cost reduction.
  • Designed and implemented a delivery capacity check parallelization algorithm that decreased service latency by 50% as part of checkout flow, significantly improving system throughput and contributing to Amazon's ultra fast delivery goal.
  • Modernized the Australian driver onboarding funnel by automating critical vehicle data collection, achieving feature parity with US/UK markets. Eliminated manual survey workflows to save operational hours and improved data accuracy for precise, target-based recruiting.
  • Supported maintenance and on-call incident response for global services and mentored 2 engineers to drive engineering best practices and operational stability.

Roblox

Feb 2021 - May 2022

Senior Software Engineer, Engineering Efficiency

San Mateo, CA, USA
  • Developed tools to aggregate test results from GitHub Actions workflows with .NET 5, TypeScript, next.js, GraphQL and NATS message queue.
  • Improved and supported CI/CD workflows for the whole company on TeamCity and GitHub Actions with HashiCorp stack (Terraform, Nomad, Vault, Consul), JFrog Artifactory and other tools.
  • Administered and supported third party engineering tools for the whole company, developed services to collect engineering efficiency metrics for each team.

Microsoft

Jul 2019 - Feb 2021

Software Engineer II, Windows Notification Services

Redmond, WA, USA
  • Built and maintained backend services connecting billions of Windows platform devices and serve half million RPS for push notifications through live TCP connections.
  • Refactored and modernized tech stacks using REST and gRPC, built internal development tools and CI/CD pipelines to improve services performance and reliability.
  • Built service and internal test suites supporting web push notification feature of new Edge browser.

Microsoft

Jun 2018 - Jul 2019

Software Engineer, CSD CFE Toolkit

Redmond, WA, USA
  • Maintained packaging tools for enterprise Windows customers. Built first CI/CD pipelines for the team.

Amazon Web Services

Feb 2017 - Jun 2018

SDE I, AWS CodePipeline

Seattle, WA, USA
  • Built CI/CD services, tools and region expansion. Reduced ticket resolution from 2 weeks to 2 days.

Projects

Shortcuts AI, Founder & Builder, getshortcuts.ai, 500+ users

May 2024 - Now
  • Founded and built the first AI Agent product from 0 to 1 across Apple Devices preceding Apple Intelligence. Integrating Siri and Shortcuts app with leading LLMs (e.g., Claude, Gemini, ChatGPT, DeepSeek, Grok, etc.) to enable users get answers and get things done faster than ever.
  • Prototyped and launched the product in 2 months and grew users from 0 to 500+ through product-led growth and community-driven marketing. Executed marketing posts, videos and user outreach on X, Reddit, LinkedIn and Xiaohongshu. Researched use cases and gathered feedbacks for feature development and product improvement.

Next PWA, Open Source Project, 4.1K GitHub stars, 34.8M downloads

March 2019 - Jan 2024
  • Built and maintained a zero-configuration Progressive Web App (PWA) plugin for Next.js. Leveraged Workbox, service workers, and Webpack to enable offline support, optimized caching, and seamless PWA adoption for modern web apps.
  • Drove open-source adoption and collaboration, grew the project to 4,000+ GitHub stars and contributions from dozens of developers by prioritizing clear documentation, examples, and backward-compatible upgrades.

Education

Johns Hopkins University

Aug 2015 - Dec 2016

Master of Science in Engineering (M.S.E.) of Computer Science

Baltimore, Maryland, USA

Shanghai University

Aug 2011 - Jul 2015

Bachelor of Science (B.S.) in Telecommunications Engineering

Shanghai, China

Internships

  • Amazon, Last Mile Transportation TechnologyMay 2016 - Aug 2016
  • SAP Labs China, Global Technology Legal ComplianceSep 2014 - Mar 2015
  • Shanghai Limei Advertising Co., Ltd, Product & Technology DepartmentJul 2014 - Aug 2014

Technical Skills

Agentic AI, AWS, Azure, GCP, Kubernetes, Docker, Microservices, TypeScript, JavaScript, Java, C#, Python, React.js, Next.js, Tailwind CSS, tRPC, Drizzle, PostgreSQL, Stripe, Model Context Protocol (MCP), Agentic Workflows, LLM Inference Optimization, vLLM, SGLang, MLOps