Sr Data Engineer
Founded in 2012, Socure is the leader in high-assurance digital identity verification technology. Named to Forbes’ 2019 AI 50 list as one of America’s most promising AI companies, and a recent winner of API World’s Best Data API, Socure’s technology applies artificial intelligence and machine learning techniques with trusted online/offline data intelligence from email, address, phone, IP, social media and the broader Internet to verify identities in real-time. Customers include three of the top five U.S. banks, seven of the top 10 U.S. card issuers, as well as the majority of leading digital banks, lenders and insurers across the U.S. We are funded by some of the world's best investors and entrepreneurs including Scale Venture Partners, Commerce Ventures, Work-Bench, Santander InnoVentures and Two Sigma Ventures
The only way we can further our mission of becoming the single, trusted source of identity verification and eliminating identity fraud is by building the best team on the planet. This is where you come in!
Socure is looking for a Principal Data Engineer to join our US engineering team and lead our data platform initiative.
In our mission to become the single, trusted source of identity verification and eliminate identity fraud from the internet, data is at the core of what we build. It’s how we innovate and how we offer the most accurate Identity Verification on the market. With the company growing very fast and our customer needs even faster, the only way for us to succeed in our mission is to significantly scale how we work with data.
We are in the early days of designing a data platform to accelerate all our data operations and unlock the creation of our future products, and we’d love you to join us and lead the way!
What You'll Do:
- You will work in close collaboration with our Engineering, Data Science, Infrastructure and Product teams to define the strategy and roadmap of our data platform.
- Own the end-to-end delivery of all projects related to our data platform initiative, from conception and design to development and production monitoring.
- Enable a wide team of Data Scientists to perfect our products and expand our offering and offer easy and secure access to data for engineering teams to deliver faster.
- You will democratize access to data and aim to automate operations of large amounts of data efficiently, securely and reliably.
- You are comfortable owning strategic initiatives end to end and working cross-functionally to ensure technical alignment.
- You like to think at scale and design, develop and operate cloud production data stores, pipelines and services that meet goals of low latency, high availability, resiliency, security and quality. And you develop these with an empathy for how they will be used and the people who will use them.
- You have experience designing data pipeline systems, ETLs and setting up large scale datastores and have used technologies like: Hadoop, HBase, S3, Kafka, Spark, DynamoDB, Hudi or Delta Lake, Elasticsearch.
- You use your technical experience to educate your peers in data engineering technologies, best practices and platform thinking.
- Exposure or familiarity of data privacy & regulations eg. GDPR, CCPA (CA) or PII.
- 6+ years of practical experience in building high scale, production distributed systems.
- Competitive base salary
- Equity - every employee is a stakeholder in our upside
- Medical, dental and vision benefits for employees and their dependents
- Parental leave and fertility support
- Flexible PTO
- 401K with company match
- Stipend to supply your home office
- Annual professional development stipend