$ ./init_portfolio.sh
 Kernel loaded
 Auth: narasimha.royal
 Mounting modules...
 Data Pipelines — online
 SQL Engine — online
 Dashboards — online

Hello I'm

Narasimha Royal

Resume

About Me

Narasimha Royal

Master’s

Computer Science
University of Houston

Expected: 2026

🎓 Bachelor's

Computer Science & Engineering
New Horizon College of Engineering

2+ years building data pipelines, ETL systems, and ingestion workflows across datasets ranging from 500K to 44M+ records. At Zensar Technologies and the University of Houston, I designed automated batch pipelines cutting prep time by 87%, built SQL-driven data models across 500K+ row databases, and maintained log ingestion pipelines serving 600+ daily users at 90%+ uptime.

Core stack: Python, SQL, Apache Airflow, PostgreSQL, dbt, and Medallion Architecture. Recent work includes an end-to-end NLP pipeline on 44.2M Amazon reviews orchestrated via Airflow (Docker), and a semiconductor supply chain intelligence platform processing multi-source BOM data.

Open to full-time Data Engineer roles starting May 2026.

Experience

Current

Instructional Assistant — Data Systems

University of Houston

Aug 2025 – May 2026
1,000+ records Schema Design ETL & Reporting
  • Designed and maintained backend relational database schemas for tracking 1,000+ student performance records, writing migration scripts and enforcing data integrity constraints across semester transitions.
  • Built source-to-target mapping documents and ETL pipelines to consolidate multi-source operational data for non-profit client (ESCH), delivering dashboards visualizing KPIs (completion rates, grade distributions) for stakeholders.

Student Assistant — Financial Data & Reporting

University of Houston

Apr 2025 – Aug 2025
ETL (Smartsheet) SQL Modeling Budget Reporting
  • Built an ETL pipeline extracting financial data from Smartsheet, transforming and loading it into a structured schema, then surfacing budget allocations, vendor payments, and delivery bottlenecks in Power BI dashboards.
  • Wrote SQL queries (CTEs, multi-table JOINs) to consolidate fragmented invoice data across departmental systems into a unified reporting layer, eliminating manual reconciliation and reducing budget discrepancies.

Graduate Assistant — Systems & Data Operations

University of Houston

Dec 2024 – Apr 2025
600+ users Python & SQL 90%+ uptime
  • Built a Python (Pandas) + SQL log ingestion pipeline processing daily system usage logs for 600+ concurrent users, parsing raw log files, flagging hardware failure patterns, and loading structured records into a PostgreSQL reporting table.
  • Delivered weekly pipeline health reports to department leadership tracking system responsiveness and exam load speeds, enabling proactive hardware optimization decisions that maintained 90%+ operational uptime.

Associate Data Engineer

Zensar Technologies — Bengaluru, KA

Mar 2023 – Apr 2024
500K+ rows ETL pipelines Data Quality
  • Designed and maintained batch ETL pipelines ingesting 500K+ delivery records, applying incremental loading, deduplication, and data quality checks before surfacing results to downstream reporting tables.
  • Built automated data quality validation scripts in Python (Pandas) with rule-based checks (null rates, referential integrity, range bounds) that eliminated recurring data errors before they reached downstream reports.
  • Wrote optimized SQL (CTEs, window functions, JOINs) to model and query the curated data layer, identifying workflow bottlenecks and SLA breaches flagged in interactive Power BI dashboards used in stakeholder syncs.

My Technical Toolkit

Data Engineering

PythonPython
SQLSQL
PostgreSQLPostgreSQL
Apache AirflowApache Airflow
Apache SparkApache Spark
DockerDocker
Apache KafkaApache Kafka
dbtdbt

Cloud & Platforms

AWSAWS
AzureAzure
GCPGCP
DatabricksDatabricks
SnowflakeSnowflake
BigQueryBigQuery

Data Analysis

PandasPandas
NumPyNumPy
MatplotlibMatplotlib
SeabornSeaborn
Power BIPower BI
TableauTableau
ExcelExcel

AI & ML

scikit-learnscikit-learn
OllamaOllama
GitHub CopilotGitHub Copilot

Other Tools

GitGit
GitHubGitHub
FastAPIFastAPI
ReactReact

Achievements

Research Paper

IoT based Smart Crosswalk System

View Publication
Stanford Code in Place

Code in Place
Stanford University

View Certificate
McKinsey Forward

Forward Program
McKinsey & Company

View Certificate
DataCamp

DataCamp Professional Certs

View Certificate
Research Paper

IoT based Smart Crosswalk System

View Publication
Stanford Code in Place

Code in Place
Stanford University

View Certificate
McKinsey Forward

Forward Program
McKinsey & Company

View Certificate
DataCamp

DataCamp Professional Certs

View Certificate

Featured Work

Data Semiconductor Supply Chain Intelligence Platform

Semiconductor Supply Chain Intelligence

Python Apache Kafka Apache Airflow PostgreSQL dbt Great Expectations Docker Medallion Architecture
Data Amazon Review Sentiment Analysis

Amazon Review Sentiment Analysis

Python PostgreSQL Apache Airflow Docker VADER Power BI
Data NVIDIA Stock Analysis

NVIDIA Stock Trend Analysis

Excel Statistical Analysis Tableau

Let's Connect

Email

narasimharoyal31@gmail.com

Send an email

Phone

(+1) 832-721-9870

Text
Home About Projects Connect
0%