Job Title: ML Eng. /Data Science Python Developer
Location: Sant Cugat, near Barcelona, Spain – Hybrid work – 2 days in a week at office.
Permanent
Job Description:
In this role, you will:
Improve our internal Python library used for model deployments by adding new features, cleaning up code, optimizing performance, and expanding unit testing.
Develop our Optimization Library from a proof of concept to a fully standardized tool for deploying optimization.
Standardize and streamline the data pipelines used for our forecast models.
Responsibilities:
Develop a detailed plan for refactoring/optimizing the code in Python while ensuring its functionality and performance are maintained
Write well-structured, efficient, and maintainable Python code, following best practices and company coding standards
Thoroughly test the new Python code, identify and resolve any bugs or issues, and ensure the results are consistent with the original implementation
Create clear documentation for the migrated Python code including comments, user guides, and technical specifications
Standardize and streamline the data pipelines used for our forecast models in PAT while defining and recommending the team best practices.
Communicate progress and challenges effectively with the project team and stakeholders
Required Skills:
A bachelor’s or master’s degree in computer science, Data Science, or a related field
At least 8 years of experience in a related role
Knowledge of algorithms, data structures, and best practices in software development
Strong programming skills in Python and experience with production grade python code
Deep knowledge of Python libraries such as Scikit-learn, PySpark, NumPy, Pandas
Familiarity with Python’s unit testing framework (currently using pytest)
Experience collaborating and using Github
Excellent communication and teamwork skills, with a strong attention to detail
Prior experience working with Databricks
Preferred Skills:
Prior experience with operationalization of ML models
Prior experience with ML models and collaborating with DS teams.
Familiarity with R programming language. This role might require to migrate some R code to Python/PySpark.
Experience with production grade pipelines and ideally with Databricks Delta Live tables or any other declarative pipelines
Knowledge of pricing algorithms and demand forecasting models