Data Engineer

Full Time
New York, NY, USA
2 weeks ago

We are seeking a highly skilled and motivated Data Engineer to join our team. The ideal candidate will have strong proficiency in Python programming language, extensive experience with Oracle PL/SQL, and a solid understanding of Azure Serverless compute resources. As a Data Engineer, you will be responsible for designing, implementing, and maintaining scalable data pipelines and solutions to support our data-driven initiatives.

 

Responsibilities

  • Data Pipeline Development: Design, develop, and maintain robust data pipelines to extract, transform, and load (ETL) data from various sources into our data warehouse using Python and Azure Serverless compute resources.
  • Data Modeling and Optimization: Work closely with analysts to design and optimize data models for performance and scalability. Utilize Oracle PL/SQL for efficient data querying and manipulation.
  • Database Management: Manage and administer Oracle databases, including schema design, indexing, and performance tuning to ensure data integrity and reliability.
  • Cloud Infrastructure Management: Deploy, configure, and manage Azure Serverless compute resources, such as Azure Functions, Azure Logic Apps and Azure Data Factory, to support data processing tasks and workflows.
  • Monitoring and Maintenance: Implement monitoring solutions to proactively identify and address issues with data pipelines and database performance. Perform regular maintenance tasks to ensure optimal performance and reliability of data infrastructure.
  • Documentation and Collaboration: Document data pipelines, database schemas, and infrastructure configurations. Collaborate with cross-functional teams including data scientists, analysts, and software engineers to understand data requirements and deliver effective solutions following our SCRUM framework.
  • Continuous Improvement: Stay up to date with emerging technologies and best practices in data engineering. Continuously evaluate and recommend improvements to enhance the efficiency, scalability, and reliability of our data infrastructure.

 

Qualifications:

- Bachelor’s or master’s degree in computer science, Engineering, or a related field.

- Proven experience as a Data Engineer or similar role, with a focus on building data pipelines and managing large-scale data infrastructure.

- Strong proficiency in Python programming language, with experience in developing data processing applications and scripts.

- Extensive experience with Oracle PL/SQL, including database design, optimization, and administration.

- Hands-on experience with Azure cloud services, particularly Azure Serverless compute resources (e.g., Azure Functions, Azure Logic Apps, Azure Data Factory).

- Solid understanding of data modeling principles and best practices.

- Excellent problem-solving skills and attention to detail.

- Strong communication and collaboration skills, with the ability to work effectively in a team environment.

 

Preferred Qualifications:

 

- Experience with Oracle cloud infrastructure.

- Knowledge of big data technologies such as Hadoop, Spark, or Kafka.

- Experience with containerization technologies such as Docker and Kubernetes.

- Familiarity with Business Intelligence tools like MicroStrategy.

- Familiarity with data visualization tools such as Power BI or Tableau.