Building robust scalable data pipelines using distributed processing frameworks (e.g. Spark, Hadoop, EMR, Flink, Storm), integrated with asynchronous messaging systems (e.g. Apache Kafka, Kinesis, PubSub, MQ Series).
Apply strong programming, algorithmic, data processing skills with significant experience in producing production-ready code in Python/Scala/Java etc, engineering experience with machine learning projects like Time Series Forecasting, Classification and Optimization problems.
Work collaboratively with teams to handle Relational Database Management Systems (RDBMS) (e.g. PostgreSQL, MySQL) , No-SQL Databases (e.g. MongoDB, ElasticSearch) and hands-on experience in implementation and performance tuning of MPP databases (e.g. Redshift, BigQuery).
The Successful Applicant
5+ years of experience in building out scalable and reliable ETL/ELT pipelines and processes to ingest data from a variety of data sources, preferably in the ecommerce retail industry.
Experience in processing frameworks (i.e. Spark, Hadoop, Flink, Storm), integrated with asynchronous messaging systems (i.e. Apache Kafka, Kinesis, PubSub, MQ Series)
Experience administering and deploying CI/CD tools (e.g. Git, Jira, Jenkins) Industrialization (e.g. Ansible, Terraform), Workflow Management ( e.g. Airflow, Jenkins, Luigi) in Linux operating system environments.
Experience designing and implementing software for Data Security, Cryptography, Data Loss Prevention (DLP), or other security tools.
Exhibits sound business judgment, a proven ability to influence others, strong analytical skills, and a proven track record of taking ownership, leading data-driven analyses, and influencing results.
Experience with cloud services like AWS, GCP or Azure
E-commerce / logistics / fashion retail background a bonus
What's on Offer
Work with multinational team
Join one of the most successful ecommerce
Flexible hours and competitive benefit
Bachelor degree in information technology, system information, computer science or equivalent.
Demonstrated proficiency in network administration (TCP/IP,â¦