⢠BSc or MSc (preferred) in a STEM field ⢠Relevant work experience of 8 years ⢠Fluency in Python (especially Numpy and Pandas) and familiarity in PySpark ⢠Extensive hands-on experience with AWS Analytical Components like S3, EC2, Lambdas, Glue, SQS, SNS, DynamoDB, Redshift, RDS etc. ⢠Experience with Data Lake Formation and Athena. ⢠Work Experience with industry standard distributed systems (ie. Spark, hive), data pipeline tools (ie. Airflow), NoSQL Databases (DynamoDB) and databases (PostgreSQL) ⢠Experience with Data Analysis, Significant experience optimizing data retrieval processes supporting API output, ideally within a low query volume / high data volume environment. ⢠Demonstrably deep experience with relevant âbig dataâ processing either via Spark or through a modern MPP database like Redshift, ideally with experience in both ⢠Demonstrably deep experience with CI/CD tools and practices in a containerized AWS environment, from deployment pipelines (Jenkins, etc), infrastructure definition (Terraform, CloudFormation, etc. ⢠Understand and design for non-functional concerns such as performance, cost optimization, maintainability, and developer experience.
Together, we create the future you always aspired to. Explore your next career opportunity.
SEE ALL OPEN POSITIONS