Nabeel Sheikh
( Oracle )
Senior Software Engineer, Compute Infrastructure | Oracle
As a Senior Software Engineer in Oracle’s Compute Infrastructure organization, I design, build, and operate highly scalable, reliable, and secure distributed systems that power Oracle Cloud Infrastructure (OCI). My work focuses on core compute services, including virtual machine lifecycle management, control-plane and data-plane services, and large-scale infrastructure automation. I collaborate closely with cross-functional teams to improve system resiliency, performance, and operational efficiency while supporting multi-tenant, globally distributed cloud environments. I contribute to architectural design decisions, develop production-grade services in Java and related technologies, and drive operational excellence through automation, monitoring, and on-call ownership, ensuring high availability and performance for AI/ML workloads.
Senior Data Engineer | Better.com (April 2023 – Present)
In my current role as a Senior Data Engineer at Better.com, I work across data science, analytics, and analytics engineering to enable data-driven decision-making at scale. I leverage tools such as Fivetran, dbt, Azure Synapse, Databricks, Snowflake, and Power BI to build and maintain scalable data pipelines, dashboards, and predictive models that support customer segmentation, targeted marketing, and revenue growth. I have designed and managed over 30 real-time and batch data pipelines integrating data from multiple sources into a lakehouse architecture, processing billions to trillions of data points. My work includes building ETL and ELT pipelines using Airflow, Python, Java, Azure Data Factory, and Databricks, integrating SaaS analytics tools such as Amplitude and Mixpanel, and collaborating with cross-functional teams to design and measure A/B tests. I have also implemented secure, Azure-based data warehousing and streaming solutions integrated with Oracle ERP systems, reducing operational costs by approximately 20%.
Software Engineer (Data) | Amazon Web Services (September 2021 – March 2023)
At Amazon Web Services, I built robust batch and streaming data pipelines to ingest data from internal and third-party sources into Amazon Redshift. I conducted advanced analytics on server utilization, customer acquisition, and retention, driving measurable improvements in operational efficiency and customer retention. I developed ETL workflows using AWS Glue, Lambda, Airflow, and Python, ensured data integrity between Oracle and Redshift, and created real-time dashboards in Amazon QuickSight to support strategic business decisions. Additionally, I contributed to CI/CD processes using Jenkins and partnered with stakeholders to translate business requirements into scalable analytics solutions.
Data Engineer | SnapCheck (December 2020 – August 2021)
At SnapCheck, I applied advanced analytics and statistical techniques to identify fraud patterns and improve the performance of predictive models in the digital payments space. I developed ETL solutions using Databricks, SQL, and Python to process data from multiple formats and sources, delivering actionable insights through Power BI. I collaborated closely with clients to design end-to-end data architectures on Azure, supporting cost reduction initiatives and enhancing the efficiency and security of payment systems.
Data Engineer | Brane Enterprises (June 2018 – December 2018)
During my tenure at Brane Enterprises, I focused on big data engineering and cloud infrastructure automation. I used Terraform to create reusable infrastructure components, implemented Sqoop for data ingestion between relational databases and HDFS, and developed Spark-based data processing workflows on Amazon EMR. I also leveraged AWS CloudTrail to support audit compliance, data lineage, and monitoring requirements.
Data Engineer | Examity (January 2017 – May 2018)
At Examity, I designed and implemented end-to-end data pipelines on Azure Synapse Analytics, integrating data from on-premises systems such as Oracle, SAP, SQL Server, and Tableau. I built scalable data storage solutions using Azure SQL Database, Azure Cosmos DB, and Azure Data Lake Storage, and developed Spark and Databricks-based ETL solutions to transform and analyze large datasets. My work significantly improved data throughput and supported enterprise reporting and analytics initiatives.