Socure is on a mission to verify 100% of good identities in real time and eliminate identity fraud from the internet.
The company uses predictive analytics and advanced machine learning trained on billions of signals to power RiskOS™, creating the most accurate identity verification and fraud prevention platform.
The Senior Data Engineer will design, build, and optimize high-performance data systems that support next-generation identity verification products.
This role requires a passion for big data, graph databases, and scalable architectures, with a strong focus on innovation, data integrity, and security.
Key responsibilities include designing and implementing scalable, secure, and high-performing data pipelines for batch and real-time processing, building and maintaining data systems for machine learning features, and developing production-grade code in Java, Scala, and Python.
The engineer will work with technologies such as Apache Spark, Kafka, Flink, Airflow, AWS EMR, and cloud-native data services on AWS.
Responsibilities also include applying graph data modeling techniques and optimizing data architectures for cost, performance, and maintainability.
The role involves collaborating with cross-functional teams to translate business needs into robust data solutions and ensuring compliance with privacy regulations and PII protection standards.
The engineer will stay current with advances in data engineering and contribute to the evolution of Socure’s data engineering best practices.
Requirements:
A Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related technical field is required.
Candidates must have 7+ years of experience building and supporting complex data systems and applications in cloud environments.
Strong proficiency in Java, Scala, or Python is essential.
Deep knowledge of distributed data processing frameworks such as Spark, Kafka, and Flink is required.
Hands-on experience with AWS cloud services and containerized environments like Docker and Kubernetes is necessary.
Understanding of software design patterns, data structures, and DevOps/CI-CD best practices is expected.
Experience with Airflow or other data pipeline orchestration services is required.
Familiarity with building ML data pipelines using platforms like Databricks or SageMaker is preferred.
Experience in developing and utilizing scalable, high-performance APIs is necessary.
Additional experience with graph databases and graph algorithms is a plus.
Benefits:
Socure offers a diverse and inclusive work environment, valuing diversity of all kinds.
The company is an equal opportunity employer and does not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.