Daily activities

Data Scientist must be able to communicate effectively and have detailed knowledge of data preparation and cleaning, algorithm selection & design, results analysis, industrialization.

The data scientist is solution-oriented, curious, open-minded, autonomous and willing to work in team. He / she is able to take initiative to propose and find innovative solutions.

Provides expertise on mathematical and statistical concepts for the broader team.

The selected candidate will be actively involved in the research aspects of developing and applying advanced data analytics, machine learning, and AI techniques and algorithms to solve challenging problems with real-

world impact. You will be creating business hypotheses, proposing new metrics for qualitative and quantitative assessment of model performance, researching and designing data analytics visions.

You will be responsible for :

  • describe, explain and document the results of the data analysis
  • perform audit of the data quality, document the data extraction and transformation needs
  • make the data cleaning and wrapping, collaborate to integrate targeted data in the data lake
  • propose, test and select the best machine learning algorithms using different data scientists’ tools and statistical methodologies to discover, describe and predict
  • participate to the industrialization specification and implementation
  • lead the functional and technical analysis of customer needs regarding analytics
  • Our expectations :

  • Ph.D. in Statistics, Math or similar, or 4 years of experience in Machine Learning, Data Mining, or Predictive Analytics
  • demonstrated capability to develop proof-of-concept prototypes for experimenting with novel algorithms
  • excellence in analyzing large, complex, multi-dimensional data sets with a variety of analytic methods (math, statistics, machine learning, etc.)
  • strong algorithmic problem solving skills and software development skills (R, Python, or similar)
  • familiarity with database technologies and querying languages
  • communicativeness in business English (both written and spoken)
  • Nice to have :

  • demonstrated experience and accomplishments in the use of data mining, machine learning, and predictive analytics to address real-life problems.
  • past experience in analyzing large scale data and distributed algorithms
  • solid knowledge in statistics
  • Machine Learning algorithms
  • Python (sklearn, pandas, numpy,..), or R data science libraries
  • ability to investigate issues and come up with resolutions for large data sets
  • certifying MOOCS (Coursera, Stanford, etc.) can be a complement
