Azure Data Architect
Crowell & Moring · Full-time · Feb 2025 – Present · Delmar, NY · Remote
- Developed a robust ETL/ELT Data Warehouse Solution utilizing Azure Synapse and Fabric.
- Employed PySpark notebooks for data processing and analysis, enhancing data workflows.
- Created Dedicated SQL Pool Stored Procedures to optimize data retrieval and management.
Senior Data Engineer
VillageMD · Full-time · Jul 2022 – Jan 2024 · 1 yr 7 mos · Delmar, NY · Remote
Senior Data Engineer for the Payer Data team. Responsible for designing and implementing a cloud-based data warehouse solution.
- Designed highly parameterized Azure Synapse pipelines that consolidate data to a centralized Data Warehouse / Delta Lakehouse hybrid solution consisting of Star Schema models.
- Azure-based Delta Lakehouse implemented with PySpark in Synapse Notebooks.
- Bronze-Silver-Gold Data Lake architecture approach.
- Data Profiling implemented via Python/PySpark.
- Version control using Azure DevOps.
- Migrating Azure Synapse–based transformations to DBT.
- Migrating Delta Table–based Data Lakehouse to Snowflake.
Senior Data Engineer
The Estée Lauder Companies Inc. · Contract · Jan 2022 – Jul 2022 · 7 mos · New York
Responsible for data ETL, augmentation, and profiling as well as corresponding documentation in Confluence. Used Azure Databricks to connect to SAP HANA Views, Synapse Views, Data Lake files and Delta files, and various internal and external APIs to prepare data for visualizations and Data Science work.
- Data Profiling of various data sources to evaluate data consistency and integrity (Azure Databricks, Delta Lake, PySpark, Pandas).
- Querying APIs to augment existing data (Databricks, PySpark, RestAPI).
- Creation and querying of Graph Databases to calculate Jaccard Similarities, Cosine Similarities, Page Ranks, etc. (Neo4j, Cypher).
- Automated QA script for testing aggregated output of dashboard against underlying unaggregated data (Databricks, PySpark).
- Data Governance and Data Architecture design of future state data solutions.
Full Stack BI Developer
Kaiser Permanente · Contract · Apr 2019 – Dec 2021 · 2 yrs 9 mos · Albany, NY
Primarily a data engineer on KP's COVID-19 reporting team.
- Built and maintained SSAS tabular models for Kaiser Permanente's Covid-19 Financial and Covid-19 Executive dashboards. Includes creation of Measures, Calculated Columns, Relationships, Hierarchies, Perspectives, and Derived Dimensions (Visual Studio, SSDT, SSAS, DAX).
- Created and maintained visualizations for Kaiser Permanente's President's dashboard. DAX formulas for complicated features and nonstandard functionality. Design coordinated with tabular model development to maximize loading speed (Power BI, DAX).
- Designed, built and maintained COVID-19 daily reporting Data Warehouse in Azure SQL Server. Migrated subset of Data Warehouse to Azure Synapse (Dedicated SQL Pool, Serverless SQL Pool).
- ETL from on-premises Oracle database to Azure SQL Server database (T-SQL, Python).
- Developer on Agile team participating in PI Planning, Stand-ups, Backlog Refinement, Retrospectives (Agile).
Senior Business Intelligence Analyst
McKesson · Oct 2016 – Apr 2019 · 2 yrs 7 mos · Albany, NY Metropolitan Area
Responsible for creating and maintaining Tableau dashboards, VBA tools, and SharePoint websites for regulatory and compliance teams. Also query SAP BW and prepare reports for legal. Create and maintain tools for expediting procedures and ETL from multiple internal databases (SAP HANA, Netezza, AS/400, MS SQL Server).
- Worked with Directors of Regulatory Affairs to develop and maintain dashboards and tools to support regulatory monitoring of suspicious orders for small and medium-sized pharmacies (Tableau, VBA, SQL, SAP HANA, SAP BW).
- Created and maintained SharePoint sites for regulatory affairs tools and updates.
- Maintained and supported interactive DEA audit tool and dosage calculators (IBM Cognos, Excel VBA).
- Conducted SQL queries and generated reports (SQL, SAP BW). Analyzed purchasing and omit data using R for future tool development.
- Developed and maintained SAP queries using Business Explorer (BEx) Query Designer.
Senior Engineer
IEEE GlobalSpec · Sep 2000 – Oct 2016 · 16 yrs 2 mos · Albany, NY
- Designed and maintained frontend search forms, dashboards, and backend database tables for IEEE's company directory and product search engine.
- Designed and maintained queries, reports, and dashboards for multiple internal departments and external clients using SQL, Tableau, Power BI, Excel VBA, and R.
- Developed and maintained over 100 database tables and search forms responsible for more than 20% of website traffic and leads. Coordinated with clients on table and search form revisions resulting in improved lead quality.
- Extracted, transformed, and loaded (ETL, SSIS) client catalog data feeds into IEEE GlobalSpec database tables.
- Analyzed database traffic to streamline product tables. Removed low-traffic, redundant and error-prone fields resulting in a 30% reduction in data entry costs with minimal impact on user experience.
- Designed an algorithm based on the Z-score of multivariate lognormal distributions that flagged errors in a 150-million-part database. Successfully identified data entry errors with minimal false positives. After parameter tuning, identified errors in clients' catalogs and product literature.
- Studied the search patterns of over 5 million registered users to determine product category preferences with respect to location, industry, and job function. Analysis used to improve ad placement on website and newsletters. Oversaw creation of an industry report product based on identified search patterns.
- Developed a "best guess" unit conversion tool for product database allowing for real-time flags of potential data entry mistakes. Leveraged to create a semi-automated data feed for IEEE's product database.
- Wrote user manuals and technical documentation and provided technical support.
Instructor – Data Analytics Boot Camp
ASPE · Jan 2013 – Dec 2015 · 3 yrs · Albany, NY Metropolitan Area
Trained professionals from a variety of backgrounds in various data analysis tools and techniques.
- Relational Databases, Access, and SQL
- NoSQL, MongoDB
- Pivot Tables and VBA in Excel and Access, Publishing to SharePoint
- Extract, Transform, Load (ETL) using SSIS
- R Programming, Tableau, and Power BI for Data Visualization
- Statistical Methods – Regression, Bayesian Inference, Machine Learning