Crawler Analyst

2 Months ago • All levels

Job Summary

Job Description

As a Crawler Analyst, you will be responsible for automating information extraction from websites using an internal tool. This involves collaborating with the Bots (Python) team to address website-specific challenges, updating templates (YAML files) to ensure continued extraction, validating existing templates, and providing feedback for tool improvement. You will also need to analyze system logs to identify and troubleshoot errors. The role requires strong analytical and problem-solving skills, as well as the ability to work both independently and as part of a team. A good understanding of HTML, XPATH, regex and Linux environments is required, along with basic knowledge of Docker and Agile methodologies. Upper-Intermediate level of English is required.
Must have:
  • Experience in scraping, scripting, programming, or coding, HTML, XPATH, regex required.
  • Experience working with Linux environments (grep, awk etc.)
  • Basic knowledge of using Docker or similar tools.
  • Familiarity with Agile methodologies (Scrum, kanban, Jira Suite...)
  • Knowledge of Git as a version control system.
  • Upper-Intermediate level of English proficiency.
Good to have:
  • More experience in Python will help to solve and detect problems easily.
  • Some experience working with Proxies.
  • Ability to find solutions by observing logs.
Perks:
  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

Job Details

As our Crawler Analyst, you will work using our internal tool to automate information extraction from websites. Every site has its own challenges so you’ll have to collaborate with the Bots (Python) team (and occasionally other teams and departments) to solve them as you learn, as well as with the developers of this tool to propose improvements. You also have to check system logs to detect possible errors in production and deduce if something is not well defined in terms of development.

Responsibilities

  • Automate information extraction for hundreds of websites using custom templates.
  • Keep extraction procedures working as websites change by updating templates (YAML files)
  • Validate current templates and escalate the more complex ones to the Python development team.
  • Thinking about elegant, reliable, and long term solutions.
  • Provide feedback about the tool so that we can improve it and make the job easier over time.


Your qualifications:

  • Have strong analytical and technical problem-solving skills.
  • Have strong communication and organizational skills.
  • Be able to work well in a team and autonomously.
  • Be flexible and able to adapt easily to changes and learn quickly.
  • Be able to learn new skills and tools using documentation and examples.
  • Be proactive and propose solutions to detected problems, and think about how to improve the process.


Requirements:

  • Experience in scraping, scripting, programming or coding is required, especially good knowledge of HTML, XPATH and regex.
  • Experience working with Linux environments (grep, awk…)
  • Basic knowledge of using Docker or similar tools.
  • Familiarity with Agile methodologies (Scrum, kanban, Jira Suite...)
  • Some familiarity with Python, although you will not use it frequently.
  • Knowledge of Git as a version control system.
  • Upper-Intermediate level of English (we are an international team and the official language in the office is English) 

 

Nice to have:

  • More experience in Python will help you to solve and detect problems easily with our tool.
  • Some experience working with Proxies.
  • Ability to find solutions by observing logs.

We offer*:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

*not applicable for freelancers

Similar Jobs

reversing labs  - Senior Full Stack Software Engineer

reversing labs

Ireland (Remote)
2 Months ago
Critical mass - Senior Media Planner

Critical mass

Toronto, Ontario, Canada (Hybrid)
1 Month ago
Mendix - Senior Software Engineer – AI Platform Development

Mendix

Rotterdam, South Holland, Netherlands (Hybrid)
2 Months ago
Britive - SENIOR UI ENGINEER- BANGALORE

Britive

Bengaluru, Karnataka, India (Remote)
7 Months ago
USE Insider - Customer Support Specialist - Istanbul

USE Insider

Istanbul, İstanbul, Türkiye (Hybrid)
8 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Ansys - Spring 2026 Simulation Process Data Management Intern (BS/MS)

Ansys

Canonsburg, Pennsylvania, United States (Hybrid)
1 Month ago
bounteous - Lead PIM Technical Analyst/Engagement Lead

bounteous

Chicago, Illinois, United States (Remote)
2 Months ago
bytedance - Frontend Software Engineer Graduate (Global E-commerce-US) - 2025 Start (BS/MS)

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago
G5 games - Front End Developer (Angular)

G5 games

Podgorica, Podgorica Municipality, Montenegro (Remote)
2 Months ago
Milk  visual effects - HR Consultant (Part time)

Milk visual effects

(Remote)
6 Months ago
Domo - Senior Technical Consultant

Domo

Pune, Maharashtra, India (On-Site)
1 Month ago
NCR Voyix - Software Engineer III

NCR Voyix

Chennai, Tamil Nadu, India (On-Site)
1 Month ago
hogarth - Graphic Production Team Lead

hogarth

Sunnyvale, California, United States (Hybrid)
1 Month ago
Yahoo - Principal Data Engineer

Yahoo

United States (Hybrid)
1 Month ago
Canva - Senior Frontend Engineer - Organising Content

Canva

Melbourne, Victoria, Australia (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Ukraine

Gunzilla - Lead Materials Artist

Gunzilla

Kyiv, Kyiv City, Ukraine (On-Site)
3 Months ago
playrix  - Lead Unity Software Engineer (Gameplay)

playrix

Ukraine (Remote)
8 Months ago
virtous games - 3D Character Artist

virtous games

Kyiv, Kyiv City, Ukraine (Remote)
1 Month ago
Playtika - TypeScript Technical Lead

Playtika

Ukraine (On-Site)
8 Months ago
Charstudios - Product Analyst

Charstudios

Lviv, Lviv Oblast, Ukraine (Remote)
2 Months ago
virtous games - 3D Weapon Artist

virtous games

Kyiv, Kyiv City, Ukraine (Remote)
1 Month ago
N-ix - Senior DevOps Engineer

N-ix

Ukraine (Hybrid)
4 Weeks ago
N-ix - Senior C++ Engineer

N-ix

Ukraine (On-Site)
1 Month ago
playrix  - Customer Support Representative (German and Russian)

playrix

Ukraine (Remote)
8 Months ago
Gunzilla - Junior Game Analyst

Gunzilla

Kyiv, Kyiv City, Ukraine (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!