Senior Data Engineer

Job Tags

Industry

Role: Senior Data Engineer

Reports to: Senior Director, Infrastructure

Department: Engineering

Location: Remote, US

Job Type: Full Time, Exempt

 

Help us Shape the Future of Data

With more than 40 million users, Anaconda is the world’s most popular data science platform and the foundation of modern AI development. We pioneered the use of Python for data science, champion its vibrant community, and continue to steward open-source projects that make tomorrow’s innovations possible. Our enterprise-grade solutions enable corporate, research, and academic institutions around the world to harness the power of open source for competitive advantage, groundbreaking research, and a better world.

Anaconda is seeking people who want to play a role in shaping the future of enterprise machine learning and data science. Candidates should be knowledgeable and capable, but always eager to learn more and to teach others. Overall, we strive to create a culture of ability and humility and an environment that is both fast-paced and focused. We stress empathy and collaboration with our customers, open-source users, and each other. 

Here is why people love most about working here: We’re not just a company, we’re part of a movement. Our dedicated employees and user community are democratizing data science and creating and promoting open-source technologies for a better world, and our commercial offerings make it possible for enterprise users to leverage the most innovative output from open source in a secure, governed way.

Summary

Anaconda is seeking a talented Senior Data Engineer to join our rapidly-growing company. This is an excellent opportunity for you to leverage your experience and skills and apply it to the world of data science and machine learning.

 

What You’ll Do:

  • Create and manage tooling and infrastructure for Anaconda’s data platform.
  • Identify and implement process improvements: designing infrastructure that scales, automating manual processes, etc.
  • Drive database design and the underlying information architecture, transformation logic, and efficient query development to support our growing data needs.
  • Implement testing and observability across the data infrastructure to ensure data quality from raw sources to downstream models.
  • Write documentation that supports code maintainability.
  • Take ownership of the various tasks that will allow us to maintain high-quality data; ingestion, validation, transformation, enrichment, mapping, storage, etc
  • Work closely with Product teams to anticipate and support changes to the data.
  • Work with Strategic Operations and Platform teams to build reliable, scalable tooling for analysis and experimentation.
  • Values collaboration and is very comfortable with pair programming

 

What You Need:

  • 6+ years of relevant experience as a data engineer or significantly related work
  • Foundation & proficiency in Python 
  • Experience in building, optimizing, and maintaining data architectures 
  • Experience building ELT pipelines 
  • Experience with Airflow, Prefect, or other orchestration tools
  • Cloud experience, i.e. AWS, Azure, GCP
  • Experience with Infrastructure as code, Terraform or CloudFormation, Ansible
  • Database experience with relational and non-relational data stores
  • Experience working with large data sets, and an understanding of how to write code that leverages the parallel capabilities of Python and database platforms
  • Strong knowledge of database performance concepts like indices, segmentation, projections, and partitions
  • Experience leading projects with Engineering and Product teams from start to finish
  • Team attitude: “I am not done, until WE are done”
  • Embody our core values:  
    • Ability & Humility
    • Innovation & Action
    • Empathy & Connection
  • Care deeply about fostering an environment where people of all backgrounds and experiences can flourish

What Will Make You Stand Out:

  • Experience with Kafka or other event-streaming technologies
  • Experience with Snowflake
  • Experience working in a fast-paced startup environment
  • Experience working in an open-source or data science-oriented company

 

Why You’ll Like Working Here:

  • Unique opportunity to translate strong open-source adoption and user enthusiasm into commercial product growth
  • Dynamic company that rewards high performers
  • On the cutting edge of enterprise application of data science, machine learning and AI
  • Collaborative team environment that values multiple perspectives and clear thinking
  • Employees-first culture
  • Flexible working hours
  • Medical*, Dental*, Vision*, HSA*, Life* and 401K*
  • Paid parental leave – both parents
  • Monthly productivity stipend 
  • Pre-IPO stock options
  • Open vacation policy* 
  • Quarterly Snake days (company-wide bonus day off)
  • 100% remote

*FTE employees based on region 

An Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability.

Anaconda, Inc. (“We”, “Us”) are committed to protecting and respecting your privacy. This Privacy Notice sets out the basis on which the personal data collected from you, or that you provide to Us, will be processed by Us in connection with Our recruitment processes. By clicking “Submit Application”, you acknowledge you have read our Privacy Policy and that Anaconda can retain your application data for up to 1-year, unless otherwise stated.  For the purpose of the General Data Protection Regulation (“GDPR”) ”) and the version of the GDPR retained in UK law (the “UK GDPR”) the Data Controller is Sydney Artt.

Ouindex 2024 © All rights reserved