Data Scientist (m/f/d)

2 Months ago • 5 Years + • Data Analyst

About the job

Job Description

This role requires 5+ years of experience as a Data Scientist with expertise in Python/R, machine learning techniques, and working with large datasets. You'll develop data-driven solutions for the construction industry, analyze data, and collaborate with cross-functional teams.
Must have:
  • Python/R proficiency
  • Machine Learning
  • Data Analysis
  • Large Datasets
Good to have:
  • Snowflake Transformations
  • Azure Data Factory
  • Kafka Ingestion
  • Parquet Files
Perks:
  • Health Days
  • Hybrid Working
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

About the job

Want to work in a culture built on mutual trust and respect? How about having the freedom to make work fit into your life (and not the other way round)? A career with Thinkproject could be just the opportunity you're looking for.


What do we do?

Thinkproject is a European market-leader in digitalisation tools for construction companies. It sounds complex, but we'll explain further! Construction companies used to use manual administration and physical paperwork for projects (sometimes hundreds of thousands of bits of paperwork for one project!). Using our construction intelligence solutions, businesses can go digital, which benefits everyone from the construction companies to the wider public.

Our mission is to deliver digitalisation to make a safer, healthier and more sustainable AECO (Architecture, Engineering, Construction, Operations) industry. This is a really exciting time to join our company, since our founding in 2000 we have gone from strength-to-strength and have lots of exciting developments coming up soon that you could be a part of.


We are looking for a skilled and motivated Data Scientist (m/f/d) to join our team in India. As a Data Scientist, you will play a critical role in analyzing, interpreting, and deriving valuable insights from vast datasets. You'll work closely with cross-functional teams to develop data-driven solutions that drive innovation and optimization within our software solution.


Location: Pune

Department: R&D

Contract: Permanent


What your day will look like

  • Collect, preprocess, and analyze structured and unstructured data sets related to the construction industry using statistical methods and machine learning techniques.
  • Develop predictive models, algorithms, and data-driven solutions to address business challenges and enhance decision-making processes.
  • Collaborate with software engineers, product managers, and domain experts to integrate analytical solutions into our cloud-based software platforms.
  • Design and implement experiments, tests, and data-driven initiatives to improve product functionalities and user experience.
  • Perform exploratory data analysis to identify trends, patterns, and correlations within construction-related datasets.
  • Communicate findings and insights to both technical and non-technical stakeholders through reports, visualizations, and presentations.
  • Stay updated with the latest advancements in data science, machine learning, and construction technology to drive innovation within the organization.


What you need to fulfill the role


Master's degree in Computer Science, Data Science, Statistics, or a related quantitative field.

5+ yrs of Previous experience in a Data Scientist or similar role, preferably within the software industry or construction domain.

Proficiency in programming languages like Python or R for data analysis, machine learning, and statistical modeling, with expertise in relevant libraries.

Strong understanding of machine learning techniques (supervised/unsupervised learning, regression, clustering, normalization, etc.), along with practical experience using libraries like scikit-learn, TensorFlow, PyTorch, etc.

Hands-on experience working with large datasets, utilizing data visualization tools (especially Power BI), and working with SQL/NoSQL databases.

Excellent problem-solving abilities and adeptness in translating business requirements into data-driven solutions.

Effective communication skills, capable of presenting complex findings in a clear and understandable manner.

Ability to contribute to the productionization of models.

Proficient in creating models from scratch and fine-tuning existing models.

Good understading of Spark SQL and PySpark.

Should be able to contribute in large models management.

Evaluate out-of-box Supervised/Un-supervised/Neural Network Models for their effectiveness on Thinkproject Business challenges

Experience in entire ML application development lifecycle - data preparation, experiment tracking, model result reproducibility and deployment.

Experience of working with both ML Training and Inference pipelines

Experience in using tools like ML Flow for ML development tracking, Apache Spark for deploying ML Production applications etc.

Flexibility to work with both Traditional and Neural network based ML models for use cases spanning NLP, Computer Vision and Tabular Structured data.


Bonus Points for:

Experience with Snowflake transformations and Snowpark.

Knowledge of Azure Data Factory or Kafka ingestion.

Understanding of Parquet file data handling.

Familiarity with MLFlow or Kubeflow


What we offer

Health Days I Lunch 'n' Learn Sessions I Women's Network I LGBTQIA+ Network I Demo Days I Coffee Chat Roulette I Ideas Portal I Free English Lessons I Thinkproject Academy I Social Events I Volunteering Activities I Open Forum with Leadership Team (Tp Café) I Hybrid working I Unlimited learning


We are a passionate bunch here. To join Thinkproject is to shape what our company becomes. We take feedback from our staff very seriously and give them the tools they need to help us create our fantastic culture of mutual respect. We believe that investing in our staff is crucial to the success of our business.


Your contact:

Vikas Gaikwad

Please submit your application, including salary expectations and potential date of entry, by submitting the form on the next page.

View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Pune, Maharashtra, India (Hybrid)

Maharashtra, India (Hybrid)

Pune, Maharashtra, India (Hybrid)

View All Jobs

Get notified when new jobs are added by Thinkproject

Similar Jobs

Bazaar Voice - Senior Data Engineer

Bazaar Voice, United Kingdom (Hybrid)

Nielsen Holdings - Data Scientist

Nielsen Holdings, United States (Remote)

Nielsen Holdings - Senior Full Stack Developer - Mumbai / Bangalore

Nielsen Holdings, India (Hybrid)

Luxoft - Senior PySpark Data Engineer

Luxoft, India (Remote)

Tencent - Product Operations Intern

Tencent, Netherlands (On-Site)

10times - Data Scientist

10times, India (On-Site)

Beghou Consulting - Lead Digital Marketing Analyst

Beghou Consulting, India (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

CloudHire - Senior Data Analyst

CloudHire, India (Remote)

Zuora - Sr Security Engineer

Zuora, India (Hybrid)

Salesforce - Staff Data Scientist

Salesforce, United States (On-Site)

Luxoft - Senior DevOps Engineer

Luxoft, Canada (On-Site)

Playtika - Data Infrastructure Director

Playtika, Israel (On-Site)

OKX - Data Engineer

OKX, Hong Kong (On-Site)

TMRW House of Brands - SDE-II (Frontend - React JS Development) (3yrs+)

TMRW House of Brands, India (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Pune, Maharashtra, India

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

LeoVegas - Sportsbook Analyst - Data Team

LeoVegas, United Kingdom (On-Site)

BigID - Senior Data Analyst

BigID, Israel (On-Site)

Al-Fahad - Data Annotator

Al-Fahad, India (On-Site)

Twitch - Senior Data Scientist - ML

Twitch, United States (On-Site)

Meta - Data Engineer, Product Analytics

Meta, United States (On-Site)

Poppulo - Senior Data Engineer

Poppulo, India (Hybrid)

Dream11 - Lead ML Scientist

Dream11, India (On-Site)

Get notifed when new similar jobs are uploaded