We are looking for a data scientist:
There are several ongoing projects that will utilize data analytics. These project will analyze sensor data (vibration, temperature, ultrasound, and laser) with the output being both real-time interfaces showing the current state as well as predictive algorithms in order to determine potential downtime or quality issues. The data that will be used for these projects will come from a variety of database systems (IPST, IPS-L, IPS-Q, FMS,…), Internet-of-Things sensors, and video cameras.
The analysis of the data will be done on a big data architecture (Hadoop) system utilizing state of-the-art analytics and machine/deep learning algorithms.
The following technical skill-set is needed in order to aid in this endeavor:
- Data Loading: Loading of data from different data
sources in Hadoop cluster (Hive data warehouse)
- Data cleaning, preparation and integration: It can
be assumed that the data sources can be
connected via a timestamp and/or global
identifiers (e.g. the build number)
- Data labeling (video feeds)
- Investigate analytics approaches:
o Exploratory Data Analysis and Dashboards (e.g.
Jupyter, Tableau, QlikView)
o Predictive modeling and machine learning tools,
such as Scikit-Learn, R, Spark MLlib
- Prototype and build models for understanding
failure modes and predictions:
o Offline and Online validation
Location/Region: Greenville, SC (US)