Senior Software Engineer - Ingestion
P-1403
At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business. Founded by engineers — and customer obsessed — we leap at every opportunity to solve technical challenges, from designing next-gen UI/UX for interfacing with data to scaling our services and infrastructure across millions of virtual machines. Databricks Mosaic AI offers a unique data-centric approach to building enterprise-quality, Machine Learning and Generative AI solutions, enabling organizations to securely and cost-effectively own and host ML and Generative AI models, augmented or trained with their enterprise data.
The impact you will have:
As a Senior Software Engineer working on Lakeflow Connect product, you will be part of a team whose mission is to bring all of the world's data into Databricks. You will be building a set of new native and highly scalable connectors for databases and enterprise applications to enable our customers to ingest structured, semi-structured and unstructured data, improve our customers productivity and let them move faster from data to insights. You will achieve this by being customer obsessed, being hands-on in building the product (you love coding!) and collaborate closely with the product team. You’ll work on challenges such as:
- Building reliable connectors with zero data and precision loss
- Scaling the connectors to support ingesting millions of rows per day
- Build incremental ingestion and reduce the end to end latency from the time data shows up in data sources to it being available in Delta Lake
- Enable end to end monitoring and observability into long running ingestion workflows
What we look for:
- BS (or higher) in Computer Science, or related field.
- 6+ years of industry experience in writing production code in one of: Java, Scala or C++.
- Experience working with data ingestion and processing systems.
- Experience with SQL and/or other database technologies, CDC and data connectors.
About Databricks
Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark™, Delta Lake and MLflow. To learn more, follow Databricks on Twitter, LinkedIn and Facebook.BenefitsAt Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visit https://www.mybenefitsnow.com/databricks.
Our Commitment to Diversity and Inclusion
At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics.
Compliance
If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.