The team's goal is for Storage to make optimal use of the evolving user demands and hardware through resource abstraction, policy direction, system simplification, and algorithmic optimizations. The Data Scientist will work with Engineering teams and other Data Scientists to provide strategic insights and direction, build probabilistic models of the systems, and compare those models with production to identify opportunities. They will also collaborate with stakeholders in cross-projects and team settings to identify and clarify business or product questions to answer, provide feedback to translate and refine business questions into tractable analysis, evaluation metrics, or mathematical models. The Data Scientist will design and evaluate models to mathematically express and solve defined problems with limited precedent. They will gather information, business goals, priorities, and organizational context, as well as the existing and upcoming data infrastructure. The Data Scientist will own the process of gathering, extracting, and compiling data across sources via tools (e.g., SQL, R, Python), and format, re-structure, or validate data to ensure quality, and review the dataset to ensure it is ready for analysis.