Senior Data Extraction Engineer

2 Months ago • All levels • Data Analysis

Job Summary

Job Description

The Senior Data Extraction Engineer will join Razer on a global mission to revolutionize gaming. The role involves designing, developing, and deploying web scraping solutions to collect datasets for AI training, building scalable web crawlers, ensuring data accuracy and compliance, cleaning and organizing scraped data, monitoring crawler performance, collaborating with AI teams, and documenting workflows. The responsibilities also include designing, developing and deploying web crawler solutions to collect specific datasets for AI model training and building robust and scalable crawlers to extract structured and unstructured data.
Must have:
  • Bachelor's or master’s in computer science or related field.
  • Experience with web scraping tools and frameworks.
  • Proficiency in programming languages.
  • Familiarity with HTTP protocols and JSON data formats.
  • Knowledge of database systems for data storage.
  • Experience with cloud platforms and containerization tools.
  • Strong understanding of web crawling ethics and regulations.
  • Excellent analytical skills and attention to detail.
Good to have:
  • Experience with large-scale data scraping.
  • Familiarity with AI and machine learning concepts.
  • Knowledge of browser automation and tools.
  • Ability to handle multilingual data.

Job Details

Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.

Job Responsibilities/ מ作责务 :

Job Description

Responsibilities:

  • Design, develop, and deploy web scraping solutions to collect specific datasets for AI training purposes.
  • Build robust and scalable web crawlers to extract structured and unstructured data from various online sources.
  • Ensure data accuracy, integrity, and compliance with relevant laws and regulations.
  • Clean, preprocess, and organize scraped data for use in machine learning models.
  • Monitor and optimize crawling performance to ensure efficiency and reliability.
  • Collaborate with AI teams to define data requirements and ensure the relevance of collected data.
  • Document crawling workflows, tools, and results for future reference.

Requirements:

  • Bachelor's or master’s degree in computer science, Software Engineering, or a related field.
  • Strong experience with web scraping tools and frameworks (e.g., Scrapy, Selenium, BeautifulSoup).
  • Proficiency in programming languages like Python, Java, or Node.js.
  • Familiarity with HTTP protocols, HTML parsing, and JSON data formats.
  • Knowledge of database systems (SQL, NoSQL) for data storage and management.
  • Experience with cloud platforms (e.g., AWS, GCP) and containerization tools (e.g., Docker).
  • Strong understanding of web crawling ethics, regulations, and best practices.
  • Excellent analytical skills and attention to detail.

Preferred Qualifications:

  • Experience with large-scale data scraping and handling distributed crawlers.
  • Familiarity with AI and machine learning concepts, especially data preprocessing for AI models.
  • Knowledge of browser automation and tools for rendering dynamic content.
  • Ability to handle multilingual data and diverse data formats.

岗位职责:

  • 设计、开发并部署网页爬虫解决方案,收集特定数据用񎣪I模型训练。
  • 构建稳健且可扩展的爬虫,提取结构化与非结构化数据。
  • 确保数据的准确性、完整性,并符合相关法律法规。
  • 对爬取的数据进行清理、预处理和组织,以便应用于机器学习模型。
  • 监控并优化爬虫性能,确保其高效可靠运行。
  • 񎃪I团队合作,明确数据需求,确保采集数据的相关性和价值。
  • 记录爬虫工作流、工具和结果,以便未来参考和改进。

岗位要求:

  • 计算机科学、软件工程或相关领域的学士或硕士学位。
  • 熟练掌握网页爬取工具与框架(如Scrapy、Selenium�utifulSoup)。
  • 熟悉Python、Java或Node.js等编程语言。
  • 熟悉HTTP协议、HTML解析和JSON数据格式。
  • 了解数据库系统(SQL、NoSQL)用于数据存储与管理。
  • 有云平台(如AWS、GCP)及容器化工具(如Docker)使用经验。
  • 深刻理解爬虫的伦理、法规及最佳实践。
  • 具备优秀的分析能力与细节关注度。

优先条件:

  • 有大规模数据爬取及分布式爬虫经验者优先。
  • 熟悉AI与机器学习概念,尤其是AI模型的数据预处理者优先。
  • 了解浏览器自动化及动态内容渲染工具者优先。
  • 能处理多语言数据及多样化数据格式者优先。

Pre-Requisites/ 任职要求 :

Are you game?

Similar Jobs

Techland - Lead VFX Artist

Techland

Warsaw, Masovian Voivodeship, Poland (On-Site)
5 Months ago
Discord - Staff Software Engineer - Desktop Platform

Discord

San Francisco, California, United States (On-Site)
2 Months ago
Team Liquid - Senior Full Stack Engineer

Team Liquid

Jakarta, Indonesia (Remote)
1 Month ago
Paradox Interactive - Engine Graphics Programmer

Paradox Interactive

Stockholm, Stockholm County, Sweden (On-Site)
1 Month ago
Avalanche Studios Group - Senior Rendering Programmer

Avalanche Studios Group

Stockholm, Stockholm County, Sweden (On-Site)
4 Months ago
Square - Senior Data Architect

Square

Madrid, Community Of Madrid, Spain (On-Site)
3 Weeks ago
Scopely - Senior Data Analyst, Marketing Analytics

Scopely

Mexico City, Mexico (Hybrid)
3 Months ago
level ai - Staff Software Engineer - Data Platform

level ai

Noida, Uttar Pradesh, India (Hybrid)
6 Months ago
Interactive Brokers - Data Engineer

Interactive Brokers

Greenwich, Connecticut, United States (Hybrid)
2 Months ago
Cognite - Senior Data Engineer

Cognite

Bengaluru, Karnataka, India (Hybrid)
11 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Tactic studios - Senior Graphics Programmer

Tactic studios

Canada (Remote)
3 Months ago
Amber - Senior Unreal Game Engineer (Project Based)

Amber

Guadalajara, Jalisco, Mexico (On-Site)
1 Year ago
Ubisoft - Lead R&D Scientist

Ubisoft

Shanghai, Shanghai, China (On-Site)
6 Months ago
Qualcomm - Engineer - Graphics Driver Development

Qualcomm

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Tavus - Support Engineer

Tavus

San Francisco, California, United States (Hybrid)
3 Months ago
Riot Games - Senior Principal Technical Artist

Riot Games

Los Angeles, California, United States (On-Site)
9 Months ago
Universal Music - Senior BI Engineer

Universal Music

Philadelphia, Pennsylvania, United States (On-Site)
9 Months ago
Qloc careers - Technical Artist

Qloc careers

Warsaw, Masovian Voivodeship, Poland (Remote)
1 Week ago
Gearbox - Lighting Artist

Gearbox

Frisco, Texas, United States (On-Site)
8 Months ago
imerza - Senior 3D Production Supervisor

imerza

Sarasota, Florida, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Chengdu, Sichuan, China

Zengame Technology - Game Customer Service

Zengame Technology

Hainan, China (On-Site)
1 Week ago
Haleon - Demand and Supply Planning Manager

Haleon

Shanghai, China (On-Site)
3 Months ago
Paper Stacking games - Ombudsman

Paper Stacking games

Shanghai, China (On-Site)
1 Month ago
Ubisoft - Senior Model Artist [Rainbow Six]

Ubisoft

Chengdu, Sichuan, China (On-Site)
1 Week ago
Mendix - Business Development Representative Intern

Mendix

Shanghai, China (On-Site)
11 Months ago
Moonton  - Senior Scene Concept Artist [MLBB]

Moonton

Shanghai, China (On-Site)
2 Weeks ago
Unity - Senior Client Partner, Supply

Unity

Beijing, China (On-Site)
2 Months ago
Paper Stacking games - Art Tool Development

Paper Stacking games

Shanghai, China (On-Site)
1 Week ago
NVIDIA - Senior Solution Architect - Hardware

NVIDIA

Beijing, Beijing, China (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Dentsu - Digital Data Analyst

Dentsu

Madrid, Community Of Madrid, Spain (On-Site)
1 Month ago
Revolgy - Data & AI Cloud Engineer

Revolgy

London, England, United Kingdom (Remote)
2 Months ago
Nagarro - Principal Engineer, Data Science

Nagarro

India (Remote)
9 Months ago
GoTo Group - Senior Data Scientist  (Singapore)

GoTo Group

Singapore (On-Site)
9 Months ago
Casumo - Senior Business Analyst

Casumo

(Hybrid)
5 Months ago
Figma - Data Engineer

Figma

United States (Remote)
1 Week ago
endava - Senior Technical Business Analyst

endava

Ho Chi Minh City, Vietnam (On-Site)
2 Months ago
Toast - Senior Data Analyst Talent Operations

Toast

Chennai, Tamil Nadu, India (Hybrid)
1 Month ago
TransUnion - Sr Analyst, Data Analysis and Consulting

TransUnion

Burlington, Ontario, Canada (Hybrid)
2 Weeks ago
CookUnity - Staff Data Scientist

CookUnity

New York, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

About The Company

At Razer, you'll be at the forefront of the most exciting industry in the world — gaming. Evolving forms of gaming require evolving forms of hardware, software and services. That’s where Razer comes in, offering innovative top-of-the-line products and services to allow gamers to fully immerse in the ultimate gaming experience.Getting onboard Razer will place you on a global mission to bring gamers closer to the games they love. Razer is a place to do great work, offering you the opportunity to be a part of a global team across 11 countries. Whether you are a hardcore evangelist who breathe life to the latest and greatest gaming gear or a behind-the-scene hero who runs our global operations, you are assured of a career-changing quest that transcends time zones and culture with one single spell: For Gamers. By Gamers.The journey towards phenomenal-ness won’t come easy. However, we will excel because gamers rely on teamwork. We achieve greatness because we are wicked problem-solvers and tenacious in clinching victories in all that we do. It is the team that makes Razer where it is today and will continue to bring Razer to even greater heights.

Singapore (On-Site)

San Jose, California, United States (On-Site)

Singapore (On-Site)

Singapore (On-Site)

Singapore (On-Site)

Singapore (On-Site)

View All Jobs

Get notified when new jobs are added by Razer

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug