Skip to content
BY

Loading experience

DATA SCIENCE • DESIGN • MOTION •DATA SCIENCE • DESIGN • MOTION •DATA SCIENCE • DESIGN • MOTION •DATA SCIENCE • DESIGN • MOTION •

Data Scientist  ·  Designer  ·  Builder

Basanth Yajman

Building things that live at the intersection of data, design, and human experience.

Basanth Yajman portrait

Open to opportunities

About

Who I am

I craft digital experiences that merge deep data analysis with high-end aesthetic design. The goal is never just to make something work — it's to make it feel alive, purposeful, and uniquely yours.

3+Years building
10+Projects shipped
2Domains: ML & Design
PythonC++GoPyTorchApache KafkaAWSKubernetesNext.jsSQLApache SparkLangGraphAzureTypeScriptTerraformPythonC++GoPyTorchApache KafkaAWSKubernetesNext.jsSQLApache SparkLangGraphAzureTypeScriptTerraformPythonC++GoPyTorchApache KafkaAWSKubernetesNext.jsSQLApache SparkLangGraphAzureTypeScriptTerraform
Film Chronicle

Chronicle

Selected Experience

Present
Jan 2025 – PresentT+16M

San Jose State University (SJSU)

Applied AI Researcher / Software Engineer (GRA) | research

Architected a fault-tolerant LangGraph multi-agent pipeline with GPT-4 tool-calling for geospatial data, analyzing structural inequity in Large Language Models (LLMs) for non-English scripts. Optimized downstream causality pipelines using PyTorch and R, directly improving risk analytics precision by 15%.

"Authored comprehensive research on LLMs & engineered multi-agent risk ingestion pipelines"

PythonLangGraphLLMsPyTorchOpenCV
Aug 2025
Jun 2025 – Aug 2025T+3M

NASA Ames Research Center

Machine Learning Engineering Intern | internship

Fine-tuned NASA Prithvi Vision Transformer via MAE pretraining in PyTorch for high-risk satellite detection. Architected scalable, cloud-native REST APIs on AWS EC2 & Lambda with Kubernetes orchestration and Prometheus observability, cutting spatial inference latency by 25%.

"Fine-tuned Vision Transformers processing 2TB daily via robust AWS/Kubernetes pipelines"

AWSKubernetesPyTorchPythonSHAP
Jun 2024
Apr 2023 – Jun 2024T+15M

AtkinsRéalis

Data & Software Engineer | freelance

Developed Kafka-backed event-driven microservices on Azure Databricks with exactly-once semantics. Deployed RESTful XGBoost inference APIs via blue-green deployments, slashing infrastructure costs by $400K and reducing unplanned operational failures by 20%.

"Designed event-driven Azure tracking algorithms for 1.8M IoT signals & ML optimization"

Azure DatabricksKafkaData EngineeringXGBoostPython
Sep 2022
Sep 2021 – Sep 2022T+13M

6D Technologies

Backend / Data Operations Engineer | freelance

Built low-latency RESTful APIs in Python/C++ for production messaging architectures. Engineered stateful stream processing via Kafka, refactored PostgreSQL materialized views tuning query latency (-28%), and deployed zero-downtime microservices using GitLab CI/CD and Docker.

"Stabilized concurrent PostgreSQL databases handling 15k+ req/sec & modernized CI/CD"

PostgreSQLC++KafkaDockerCI/CD

Work

Shipped

Sequential Horizon
01F1 Prediction Engine
Data ·Python / XGBoost / Monte Carlo

F1 Prediction Engine

A triple-model ensemble (XGBoost, Monte Carlo, Bayesian) predicting the 2026 F1 era with 38.9% accuracy.

02LLM Multilingual Deficit
Data ·NLP / Tokenization / LLMs

LLM Multilingual Deficit

Quantifying the structural 'Token Tax' and economic inequality disadvantaging non-English languages in global LLMs.

03Cartograph
Data ·React / Three.js / D3.js

Cartograph

An interactive data visualization platform that transforms raw datasets into cinematic 3D narratives.

Research & Engineering

Deep dives

AI/ML

F1 Race Winner Prediction System: The Triple Ensemble Approach

Modeling the 2026 ground-effect era using XGBoost, Bayesian Inference, and lap-by-lap Monte Carlo simulations—achieving a 38.9% prediction accuracy.

Research

The Multilingual Deficit: Tokenization Inequality in LLMs

A quantitative analysis of why non-English scripts pay a 3-5x 'Token Tax' and how byte-level BPE failures creates structural disadvantage.

UX Design

Designing for Emotion: The Luminary System

How a mood-adaptive music player transformed engagement by 340% through real-time facial micro-expression analysis via TensorFlow.js.

Visual

Data as Art: Cartograph Visuals

Turning raw CSV datasets into cinematic 3D narratives. Applying Three.js and D3 to make data not just functional, but beautiful.

Writing

Transmissions

[DATA + F1]

Why F1 Data Is the Best Playground for Learning ML

Mar 15, 20257 MIN READ
[MUSIC]

The GOT Score is Statistically the Greatest TV Soundtrack

Nov 20, 20246 MIN

Library

What I'm consuming

Books read & screens watched in 2025. Hover to read my take.

The Design of Everyday Things

The Design of Everyday Things

Don Norman

Design
Sapiens

Sapiens

Yuval Noah Harari

History
The Creative Act

The Creative Act

Rick Rubin

Creative
Zero to One

Zero to One

Peter Thiel

Business
Thinking, Fast and Slow

Thinking, Fast and Slow

Daniel Kahneman

Psychology
The Almanack of Naval Ravikant

The Almanack of Naval Ravikant

Eric Jorgenson

Philosophy
Show Your Work!

Show Your Work!

Austin Kleon

Creative

Quotes

Words I live by

Vision

Thepeoplewhoarecrazyenoughtothinktheycanchangetheworldaretheoneswhodo.

Steve Jobs

01 / 05
Design

Simplicityistheultimatesophistication.

Leonardo da Vinci

02 / 05
Growth

Youdon'thavetobegreattostart,butyouhavetostarttobegreat.

Zig Ziglar

03 / 05
Agency

Thebestwaytopredictthefutureistocreateit.

Peter Drucker

04 / 05
Life

Stayhungry.Stayfoolish.

Stewart Brand

Whole Earth Catalog

05 / 05

Get in touch

Let'smakesomething

Have a project in mind? A collaboration idea? Or just want to say hi?
My inbox is always open.

Or find me on
Open to opportunities