Data Engineer
Job Overview
-
Date PostedFebruary 20, 2025
-
Company Location
-
Expiration dateMarch 22, 2025
-
Experience2-5 Years
-
GenderBoth
-
Company NameAlgoscale
Job Description
Job Summary:
We are seeking a skilled and motivated Data Engineer to join our team. The ideal candidate will design, develop, and maintain scalable data pipelines, optimize data storage solutions, and ensure efficient data integration across multiple systems. You will play a critical role in building the data infrastructure that supports our business intelligence, analytics, and machine learning initiatives.
Key Responsibilities:
Design, build, and maintain scalable and reliable ETL/ELT pipelines to ingest and process large datasets from various sources.
Develop and manage data warehouses, lakes, and databases to store and structure data efficiently.
Collaborate with data analysts, data scientists, and software engineers to optimize data workflows.
Implement and manage data governance, security, and compliance best practices.
Monitor data performance and troubleshoot issues to ensure high availability and reliability.
Work with cloud-based data platforms (AWS, Azure, GCP) to develop and optimize cloud data solutions.
Create and maintain documentation for data pipelines, workflows, and data models.
Required Qualifications:
Bachelor’s or Master’s degree in Computer Science, Data Engineering, Information Systems, or a related field.
3+ years of experience in data engineering or a related role.
Strong proficiency in SQL and experience with relational and NoSQL databases (e.g., PostgreSQL, MySQL, MongoDB, BigQuery).
Experience with data pipeline and workflow management tools (e.g., Apache Airflow, Luigi, AWS Glue).
Hands-on experience with big data technologies such as Hadoop, Spark, or Kafka.
Strong experience with Apache Spark and AWS Glue for data processing.
Proficiency in Python, Scala, or Java for data processing and automation.
Familiarity with cloud platforms (AWS, Azure, GCP) and their data-related services.
Strong understanding of data modeling, warehousing, and ETL processes.
Experience with CI/CD practices for data engineering and deployment.
Excellent problem-solving skills and ability to work in a collaborative team environment.
Preferred Qualifications:
Experience with real-time data streaming and processing technologies.
Knowledge of machine learning pipelines and AI-driven data solutions.
Certification in cloud data services (AWS Certified Data Analytics, Google Cloud Professional Data Engineer, etc.).
Familiarity with containerization and orchestration (Docker, Kubernetes).
What we offer:
Opportunity to collaborate with leading companies across the globe.
Opportunity to work with the latest and trending technologies.
Competitive salary and performance-based bonuses.
Comprehensive group health insurance.
Flexible working hours and remote work options.
Generous vacation and paid time off.
Professional learning and development programs and certifications.