概要

I am a Data Engineer with a strong track record in both corporate and freelance environments, specializing in data integration, system migration, and process optimization. Currently working at Bluetech Consulting on a significant project for Allied Bank, I am involved in migrating their systems from Oracle to Cloudera. My role includes converting Oracle views and table to Cloudera-compatible views and tables, translating packages and procedures into PySpark functions, and performing data consolidation using Hive, Impala, PySpark, and Cloudera. My technical expertise ensures seamless data flow and enhances data accessibility and accuracy.


Previously, I worked as a Data Engineer at Huawei, where I integrated third-party data into the Huawei ecosystem and optimized data flow using Huawei's internal tools, IDE and Smartcare. My proficiency in programming languages like Groovy, Java, and Python allows me to design effective data integration solutions and streamline complex data processes.


In addition to my corporate roles, I am an experienced freelancer on platforms like Fiverr and Upwork, with over 100 completed projects across diverse domains.


I am committed to delivering high-quality solutions and continuously advancing my skills to excel in every project I undertake. My passion for data engineering and proven ability to adapt to different challenges make me a valuable asset in any data-driven environment.

项目

Telecom Churn Prediction Model

工作经历

公司标识
Data Engineer
Blutech Consulting
Feb 2024 - 代表 | Islamabad, Pakistan

As an Data Engineer at Bluetech Consulting, I am actively engaged in a high-impact project for Allied Bank, focusing on migrating legacy systems from Oracle to Cloudera. My responsibilities include converting Oracle views into Cloudera-compatible views, translating Oracle packages and procedures into efficient PySpark functions, and performing data consolidation to ensure seamless data integration and accuracy.
In this role, I have honed my expertise in Hive, Impala, PySpark, and Cloudera, and I have leveraged MobaXterm to facilitate data migration processes. My work involves handling large datasets, optimizing complex queries, and ensuring data consistency across platforms. I am committed to delivering high-quality, scalable solutions that meet Allied Bank’s business needs, streamline operations, and improve data accessibility.
My experience in this project demonstrates my ability to manage and execute end-to-end data migration and transformation tasks within a big data ecosystem, showcasing my skills in both technical and strategic aspects of data engineering.

公司标识
Freelancer
Fiverr
Oct 2020 - 代表 | Taxila, Pakistan

I provide writing, subtitling, transcription, and virtual assistant services on freelancing platforms like Fiverr, Upwork, etc. I have completed more than 100 projects on Fiverr. I am a level two seller on Fiverr. I also work with a client outside of Fiverr.

公司标识
Data Engineer
Huawei Technologies
Sep 2022 - Feb 2024 | Islamabad, Pakistan

 Key Responsibilities:


Data Extraction and Ingestion: Collaborate with Telecom companies to extract their data and ensure its accurate ingestion into Huawei's data ecosystem. Develop efficient data extraction processes to gather diverse data sets.


Data Transformation: Utilize Groovy and Java programming languages to design and implement robust data transformation processes. Cleanse, enrich, and format raw data to align with Huawei's data models and standards.


ETL Pipeline Development: Design, develop, and maintain end-to-end ETL pipelines that automate data movement and transformation. Optimize these pipelines for performance, reliability, and scalability.


Quality Assurance: Implement data quality checks and validation procedures to identify anomalies and discrepancies in the incoming data. Ensure data accuracy and consistency throughout the ETL process.


Custom Tool Integration: Utilize Huawei's internal software IDE, Smartcare, Mobaxterm, and IntelliJ to create custom tools and scripts that enhance the efficiency of ETL processes. Collaborate with cross-functional teams to integrate these tools seamlessly.


Troubleshooting and Optimization: Identify and resolve issues within ETL pipelines, ensuring smooth and uninterrupted data flow. Continuously optimize ETL processes for speed, efficiency, and resource utilization.


Collaboration: Collaborate with Telecom company partners, data analysts, data scientists, and other stakeholders to understand their data requirements and provide timely data solutions.


Documentation: Maintain comprehensive documentation for ETL processes, including data mappings, transformations, and pipeline architecture. Ensure that knowledge is shared effectively across the team.


Continuous Learning: Stay updated with industry trends, best practices, and emerging technologies related to ETL, data integration, and programming languages. Apply new knowledge to enhance ETL processes and solutions.


Qualifications and Skills:

Strong proficiency in Linux, Groovy, and Java programming languages.
Experience using Huawei's internal software IDE, Smartcare, Mobaxterm, and IntelliJ.
Solid understanding of ETL concepts and data integration techniques.
Familiarity with Telecom industry data formats and standards is a plus.
Problem-solving mindset with the ability to diagnose and resolve technical issues.
Detail-oriented approach to data quality and accuracy.
Effective communication skills to collaborate with diverse teams and stakeholders.
Ability to work independently and in a collaborative team environment.
Approximately one year of professional experience as an ETL Developer, with demonstrated accomplishments in data integration and transformation.

As an ETL Developer at Huawei, I am committed to ensuring the smooth flow of data from Telecom companies to Huawei's systems, contributing to the company's mission of driving innovation in the telecommunications industry through advanced data solutions and insights.

学历

hitec university taxila
学士, 理工学士, ‎
Computer Science
CGPA 3.3/4
2022

技能

中级 ATS Knowledge
初学者 Business Analytics
初学者 Business Intelligence
初学者 Data Analytics
初学者 Data Visualization Skills
中级 ETL
中级 Groovy
中级 Hadoop
中级 Linux
中级 Power BI
中级 Python Knowledge
中级 SQL
中级 Tablaeu
中级 Testing

语言

熟练 乌尔都语
中级 英语