We are looking for a results-driven Senior Data Scientist to design and implement innovative data solutions. You will work on complex datasets, build advanced ETL/ELT pipelines, and apply big data technologies to support scalable analytics and application development.
Key Responsibilities:
· Design and implement efficient ETL/ELT pipelines using tools such as Apache Nifi, AWS Glue, and Talend.
· Extract, transform, and load data from diverse sources, ensuring accuracy and consistency.
· Utilize Python libraries (e.g., Pandas, NumPy, PySpark) to analyze, process, and visualize data.
· Develop machine learning models to generate insights and improve business decision-making.
· Work with frameworks like Hadoop, Spark, and Kafka for processing large-scale datasets.
· Optimize workflows on cloud-based platforms (e.g., AWS EMR).
· Support real-time or batch data migration efforts using tools like Kafka, Spark Streaming, or AWS DMS.
· Ensure seamless data migration between source and target systems.
· Write and optimize SQL queries for relational databases (e.g., MySQL, PostgreSQL, MS SQL Server).
· Work with NoSQL databases (e.g., MongoDB) for non-relational data storage and analysis.
Key Qualifications:
· Bachelor’s or Master’s degree in Computer Science, Data Science, or a related field.
· 5+ years of experience in data science and analytics.
· Proficiency in Python, SQL, and ETL/ELT processes.
· Hands-on experience with big data technologies (Hadoop, Spark, Kafka).
· Familiarity with cloud platforms like AWS, Azure, or GCP.
· Strong analytical and problem-solving skills with a passion for working with data.