Senior Software Engineer at
Oscilar
2023
-
2026
AI Risk Decisioning Platform — real-time fraud, credit, and identity decisioning powered by LLMs, ML models, and rules orchestration for fintech and banking customers.
AI / LLM Engineering
• Owned end-to-end design of the LLM-powered risk decisioning engine, combining RAG, embeddings, and rules orchestration to deliver sub-100ms AI-assisted fraud and credit decisions in production.
• Engineered hybrid retrieval (BM25 + dense vector embeddings) with cross-encoder reranking over policy documents, transaction history, and customer profiles, improving decision relevance by 40% measured via RAGAS.
• Built LLM-driven case investigation workflows that automate fraud analyst tasks (entity resolution, narrative generation, evidence gathering) previously taking 30+ minutes per case down to under 2 minutes.
• Implemented semantic chunking and context optimization on policy and transaction corpora, reducing LLM token usage by 35% while maintaining 95%+ decision accuracy.
• Deployed quantized LLM inference using vLLM and Triton Inference Server, cutting GPU costs by 60% while serving 1K+ decisions/sec at p99 latency under 200ms.
• Designed multi-agent LangChain workflows with stateful memory and tool use for risk reasoning, calling internal services for KYC checks, device fingerprints, and behavioral signals.
• Built LLM evaluation pipelines with hallucination detection, groundedness scoring, and prompt versioning across 50K+ daily decisions, integrated into CI for prompt regressions.
• Implemented function calling and structured outputs with OpenAI and Anthropic Claude APIs to produce deterministic decision schemas consumable by downstream rules engines.
• Productionized PyTorch, XGBoost, and scikit-learn models for fraud scoring with MLflow versioning and A/B testing of 10+ model variants with automated rollback.
• Built real-time feature engineering pipelines on Redis Streams delivering sub-second feature freshness for ML inputs across velocity, geo, and device features.
Backend & Distributed Systems
• Architected event-driven decisioning microservices on Kafka with CQRS and exactly-once delivery, processing 10K+ risk events/sec with 99.9%+ uptime.
• Built low-latency decision APIs using FastAPI, Spring Boot, and Node.js, powering customer-facing risk endpoints integrated by fintech partners.
• Implemented circuit breakers and bulkhead isolation across model-serving and rules services, reducing cascading failures by 85% during dependency outages.
• Optimized PostgreSQL queries and Redis caching for case data and feature lookups, reducing retrieval latency from 250ms to 150ms with read replicas and connection pooling.
• Designed gRPC-based internal service mesh between rules engine, model server, and feature store with OpenTelemetry distributed tracing across the decision path.
Frontend & Real-Time UI
• Built React + Next.js analyst console with TypeScript and WebSocket streaming, rendering live decision streams and 10K+ alerts per minute with sub-50ms UI updates.
• Reduced Largest Contentful Paint from 3.2s to 1.1s and improved Lighthouse score from 62 to 94 through code splitting, SSR/ISR, and image optimization on the case review dashboard.
• Engineered virtualized data tables (TanStack Table) rendering 100K+ transactions at smooth 60fps for analyst review of high-volume fraud cases.
• Built reusable component library with Storybook covering 80+ UI primitives shared across analyst, admin, and customer-facing applications, on shadcn/ui and Radix.
• Achieved 90%+ frontend test coverage using Jest, React Testing Library, and Playwright E2E tests integrated into CI.
• Implemented WCAG 2.1 AA accessibility across analyst dashboards with screen reader support and keyboard navigation.
Platform & DevOps
• Deployed multi-region AWS infrastructure on EKS with Helm and ArgoCD, sustaining 99.9%+ availability for customer decision traffic.
• Built CI/CD pipelines reducing deployment cycle from ~30 minutes to under 10 minutes per release, with canary rollouts and automated rollback on SLO breaches.