Data Engineer II

Vollzeit
vor 8 Monate

Location: Canada

About the team:

Our mission as a Data Analytics & Engineering organization is to enable Hashicorp to leverage data as a strategic asset by providing reliable, scalable, and efficient data solutions. Our ultimate goal is to empower our stakeholders to make informed, data driven decisions, and achieve critical business objectives. We are seeking a mid-level engineer to join our team! 

In this role you can expect to:

  • Oversee and govern the expansion of existing data architecture and the optimization of data query performance via best practices. The candidate must be able to work independently and collaboratively.
  • Develops and maintains scalable data pipelines and builds out new API integrations to support continuing increases in data volume and complexity.
  • Collaborates with analytics and business teams to improve data models that feed business intelligence tools, increasing data accessibility and fostering data-driven decision making across the organization.
  •  Implements processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
  •  Writes unit/integration tests, contributes to engineering wiki, and documents work.
  •  Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
  •  Designs data integrations and data quality framework.
  •  Designs and evaluates open source and vendor tools for data lineage.
  •  Works closely with all business units and engineering teams to develop strategy for long term data platform architecture.
  •  Develop best practices for data structure to ensure consistency within the system

You may be a good fit for our team if you have:

  • Bachelor's or Master's in computer engineering, computer science or related area.
  • Experience in developing and deploying data pipelines, preferably in the Cloud
  • Minimum 2 years of experience with snowflake- snowflake SQL, Snow pipe, streams, Stored procedure, Task, Hashing, Row Level Security, Time Travel etc.
  • Hands on experience with Snowpark and App development with Snowpark and Stream lit.
  • Strong experience in ETL or ELT Data Pipelines and various aspects, terminologies with Pure SQL like SCD Dimensions, Delta Processing etc.
  • Working with AWS cloud services - S3, Lambda, Glue, Athena, IAM, CloudWatch, 
  • Hands-on experience in API ( Restful API) development and maintenance with Cloud technologies( Like AWS API Gateway, AWS lambda etc).
  • Experience in creating pipelines for real time and near real time integration working with different data sources - flat files, XML, JSON, Avro files and databases
  • Fluent in Python/Go language to be able to write maintainable, reusable, and complex functions for backend data processing. Front development with python is good to have but not necessary.
  • Strong written and oral communication skills with the ability to synthesize, simplify and explain complex problems to different audiences.

#LI-Remote