About the job
SummaryBy Outscal
Senior Data Engineer with 5+ years experience in building and maintaining data systems and machine learning models using Scala. Must have expertise in Apache Spark, Scala microservices, GCP Dataflow, and high-throughput systems.
We’re seeking a Data Engineer to build and maintain our data systems and machine learning models, using Scala to support and enhance our products.
We have primed ourselves since day one to build one of the most resilient and scalable systems to support what we saw as one of the most interesting challenges in our careers. Our system currently processes TBs/s of data with upwards of 2 million writes per second - and as we grow, those numbers are expected to grow with us.
We have also built our infrastructure and systems to allow innovation. If you have an idea to implement, or new technology to try out, this is the right place for you.
As a Data Engineer, you will be working on our core systems, engines, and product services using the most sophisticated cutting-edge technologies to solve challenging problems that are unique to the DEI world.
- Work with Apache Spark batch & real-time streaming to process data at scale
- Work with Scala microservices hosted on K8s (GKE) to support our products
- Work with MlLib and TensorFlow on Vertex AI to solve forecasting and simulations in addition to a variety of machine learning challenges
- Deploy new models and engines for new ideas and analyses
- 5+ years of experience
- Advanced knowledge of different non-relational schema models (Column Family, Graph, Document, Object)
- Experience with advanced topologies and architecture design such as KAPPA and LAMBDA architectures
- GCP Dataflow experience
- Experience in Scala
- Experience in Python or Java
- Experience with high throughput systems
- Strong communication skills and at least a C1 level of English