The team's goal is for Storage to make optimal use of the evolving user demands and hardware through resource abstraction, policy direction, system simplification, and algorithmic optimizations. You will work with Engineering teams and other Data Scientist to provide strategic insights and direction, build probabilistic models of the systems, and compare those models with production to identify opportunities. You will collaborate with stakeholders in cross-projects and team settings to identify and clarify business or product questions to answer, provide feedback to translate and refine business questions into tractable analysis, evaluation metrics, or mathematical models. You will also design and evaluate models to mathematically express and solve defined problems with limited precedent. You will gather information, business goals, priorities, and organizational context, as well as the existing and upcoming data infrastructure. You will also own the process of gathering, extracting, and compiling data across sources via tools (e.g., SQL, R, Python), and format, re-structure, or validate data to ensure quality, and review the dataset to ensure it is ready for analysis.