Rubina Naushad Lakhani

Senior Consultant | Azure Data Engineer

Results-driven Azure Data Engineer with 7.8 years of total experience and 3.5 years of relevant experience in designing, developing, and deploying complex data systems on Azure.

7.8 Years Experience
3.5 Azure Experience

About Me

Proven ability to work with large and complex data sets, and expertise in utilizing Azure technologies such as Azure Data Factory, Azure Databricks, Azure Data Lake Storage and Azure SQL Database.

Seeking a role in an innovative organization that values my expertise and allows me to continuously improve and expand my skills.

Complex Data Pipeline Design
Big Data Processing & Analytics
Cloud Data Solutions
Cross-functional Collaboration

Work Experience

Senior Consultant

Capgemini 04/2022 - Present
  • Designed and developed complex data pipelines using Azure Data Factory to process and transfer large data sets from various sources including JSON, REST API, CSV, XLSX, and Parquet to data lake.
  • Worked extensively with Databricks and PySpark for big data processing and analytics.
  • Developed complex data transformation logic using Scala, and performed data cleaning, data enrichment, and data validation using PySpark and SparkSQL.
  • Collaborated with cross-functional teams and business stakeholders to gather specifications and implement data solutions.
  • Utilized Azure Data Lake Storage and Azure Blob Storage to design and implement data lake and data warehousing solutions.
  • Experience in working with Delta Lake on Databricks platform, including configuring and optimizing Delta Lake tables, and performing complex data operations using SQL and PySpark.
  • Experience with Azure DevOps for continuous integration and deployment of ADF and Databricks code.

Technology Analyst

Infosys 06/2020 - 04/2022
  • Worked with other departments and key decision makers to gather requirements and develop data solutions that fulfilled the objectives of the project.
  • Proficient in connecting to various data sources, including Azure Data Lake Storage, Azure SQL Database, API and CSV From Azure Databricks.
  • Created advanced data manipulation techniques using Python, and cleaned, enhanced and validated data using PySpark and SparkSQL.
  • Experience in creating and maintaining interactive dashboards and reports using Power BI.

Senior Systems Engineer

Infosys 06/2016 - 06/2020
  • Worked as an Oracle Fusion Middleware developer with experience in Oracle Data Integrator, Oracle BI Publish and SQL.

Technical Skills

Azure Data Factory

Azure Databricks

PySpark

Scala

SparkSQL

Delta Lake

Azure SQL

Azure Data Lake Storage

Power BI

Certifications & Achievements

Microsoft Certified: Azure Data Engineer Associate

DP-203

11/2022 - 11/2024

Databricks Certified Data Engineer Associate

12/2022 - 12/2024

Microsoft Certified: Azure Data Fundamentals

DP-900

08/2021

Databricks Lakehouse Fundamentals

10/2022

Achievements

XtraMile Certificate

06/2022 - For outstanding performance

Certificate of Excellence: Delivery Ninja

01/2022 - For consistent performance and outstanding commitment to work

Insta Award

2018

Education

Bachelor of Engineering in Information Technology

Muffakham Jah College of Engineering and Technology

2012 - 2016

Get In Touch

I'm always open to discussing new opportunities and interesting projects.