Senior Data Extraction Engineer

1 Month ago • All levels • Data Analysis

Job Summary

Job Description

The Senior Data Extraction Engineer will join Razer on a global mission to revolutionize gaming. The role involves designing, developing, and deploying web scraping solutions to collect datasets for AI training, building scalable web crawlers, ensuring data accuracy and compliance, cleaning and organizing scraped data, monitoring crawler performance, collaborating with AI teams, and documenting workflows. The responsibilities also include designing, developing and deploying web crawler solutions to collect specific datasets for AI model training and building robust and scalable crawlers to extract structured and unstructured data.
Must have:
  • Bachelor's or master’s in computer science or related field.
  • Experience with web scraping tools and frameworks.
  • Proficiency in programming languages.
  • Familiarity with HTTP protocols and JSON data formats.
  • Knowledge of database systems for data storage.
  • Experience with cloud platforms and containerization tools.
  • Strong understanding of web crawling ethics and regulations.
  • Excellent analytical skills and attention to detail.
Good to have:
  • Experience with large-scale data scraping.
  • Familiarity with AI and machine learning concepts.
  • Knowledge of browser automation and tools.
  • Ability to handle multilingual data.

Job Details

Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.

Job Responsibilities/ מ作责务 :

Job Description

Responsibilities:

  • Design, develop, and deploy web scraping solutions to collect specific datasets for AI training purposes.
  • Build robust and scalable web crawlers to extract structured and unstructured data from various online sources.
  • Ensure data accuracy, integrity, and compliance with relevant laws and regulations.
  • Clean, preprocess, and organize scraped data for use in machine learning models.
  • Monitor and optimize crawling performance to ensure efficiency and reliability.
  • Collaborate with AI teams to define data requirements and ensure the relevance of collected data.
  • Document crawling workflows, tools, and results for future reference.

Requirements:

  • Bachelor's or master’s degree in computer science, Software Engineering, or a related field.
  • Strong experience with web scraping tools and frameworks (e.g., Scrapy, Selenium, BeautifulSoup).
  • Proficiency in programming languages like Python, Java, or Node.js.
  • Familiarity with HTTP protocols, HTML parsing, and JSON data formats.
  • Knowledge of database systems (SQL, NoSQL) for data storage and management.
  • Experience with cloud platforms (e.g., AWS, GCP) and containerization tools (e.g., Docker).
  • Strong understanding of web crawling ethics, regulations, and best practices.
  • Excellent analytical skills and attention to detail.

Preferred Qualifications:

  • Experience with large-scale data scraping and handling distributed crawlers.
  • Familiarity with AI and machine learning concepts, especially data preprocessing for AI models.
  • Knowledge of browser automation and tools for rendering dynamic content.
  • Ability to handle multilingual data and diverse data formats.

岗位职责:

  • 设计、开发并部署网页爬虫解决方案,收集特定数据用񎣪I模型训练。
  • 构建稳健且可扩展的爬虫,提取结构化与非结构化数据。
  • 确保数据的准确性、完整性,并符合相关法律法规。
  • 对爬取的数据进行清理、预处理和组织,以便应用于机器学习模型。
  • 监控并优化爬虫性能,确保其高效可靠运行。
  • 񎃪I团队合作,明确数据需求,确保采集数据的相关性和价值。
  • 记录爬虫工作流、工具和结果,以便未来参考和改进。

岗位要求:

  • 计算机科学、软件工程或相关领域的学士或硕士学位。
  • 熟练掌握网页爬取工具与框架(如Scrapy、Selenium�utifulSoup)。
  • 熟悉Python、Java或Node.js等编程语言。
  • 熟悉HTTP协议、HTML解析和JSON数据格式。
  • 了解数据库系统(SQL、NoSQL)用于数据存储与管理。
  • 有云平台(如AWS、GCP)及容器化工具(如Docker)使用经验。
  • 深刻理解爬虫的伦理、法规及最佳实践。
  • 具备优秀的分析能力与细节关注度。

优先条件:

  • 有大规模数据爬取及分布式爬虫经验者优先。
  • 熟悉AI与机器学习概念,尤其是AI模型的数据预处理者优先。
  • 了解浏览器自动化及动态内容渲染工具者优先。
  • 能处理多语言数据及多样化数据格式者优先。

Pre-Requisites/ 任职要求 :

Are you game?

Similar Jobs

Figma - Software Engineer, Rendering & Animation

Figma

San Francisco, California, United States (Remote)
2 Weeks ago
playground - Lighting Artist

playground

Royal Leamington Spa, England, United Kingdom (Hybrid)
2 Months ago
Electronic Arts - Senior Environment Artist, External Development

Electronic Arts

Montreal, Quebec, Canada (Hybrid)
1 Month ago
SideFX - 3D Software Developer

SideFX

Toronto, Ontario, Canada (Hybrid)
5 Months ago
HoYoverse - Senior Business Development Manager

HoYoverse

Québec City, Quebec, Canada (Remote)
3 Months ago
Unity - Staff Data Scientist

Unity

Montreal, Quebec, Canada (On-Site)
2 Months ago
Capgemini - Data Analyst (Consultant)

Capgemini

Pune, Maharashtra, India (On-Site)
1 Month ago
PayPal - Fraud Science Data Engineer

PayPal

Scottsdale, Arizona, United States (Hybrid)
1 Month ago
zeta - Principal Data Scientist II

zeta

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Mindstorm studios - Data Analyst

Mindstorm studios

Lahore, Punjab, Pakistan (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Kabam - Senior VFX Artist

Kabam

Vancouver, British Columbia, Canada (On-Site)
2 Months ago
flix interactive - VFX Artist

flix interactive

Birmingham, England, United Kingdom (Remote)
1 Month ago
playrix  - Senior/Lead 2D Artist (Generalist)

playrix

Ukraine (Remote)
8 Months ago
Milk  visual effects - Environment TD

Milk visual effects

(Remote)
7 Months ago
Hero Gaming - Senior Frontend Developer

Hero Gaming

Marbella, Andalusia, Spain (Hybrid)
9 Months ago
playrix  - Principal 2D Artist

playrix

Portugal (Remote)
8 Months ago
tecHouse Games - CG Artist (Post-production)

tecHouse Games

Lahore, Punjab, Pakistan (On-Site)
3 Years ago
Xentrix studios - Compositing – Senior Artist

Xentrix studios

India (On-Site)
7 Months ago
bytedance - Frontend Software Engineer - Customer Service Platforms - Seattle

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago
Riot Games - Senior Technical Artist (Rendering) - VALORANT, UI/UX

Riot Games

United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Chengdu, Sichuan, China

Rolls-Royce - Programme Manager

Rolls-Royce

Shanghai, China (On-Site)
1 Month ago
Buckman - 3D Special Effects

Buckman

Shanghai, China (On-Site)
4 Weeks ago
Cadence - Leader software engineer

Cadence

Shanghai, China (On-Site)
3 Weeks ago
Paper Stacking games - Japanese Brand Event Planner - Love and Deepspace

Paper Stacking games

Shanghai, China (On-Site)
1 Month ago
Riot Games - Research Operation Coordinator

Riot Games

Shanghai, China (On-Site)
1 Year ago
Paper Stacking games - Ombudsman

Paper Stacking games

Shanghai, China (On-Site)
3 Weeks ago
Finger Tango - Game Product Operations (Data Analysis Focused)

Finger Tango

Guangzhou, Guangdong Province, China (On-Site)
1 Year ago
Tencent - External Cooperation PM

Tencent

Shenzhen, Guangdong Province, China (On-Site)
1 Month ago
Thatgamecompany - Marketing Manager - Offline Events - China

Thatgamecompany

Shanghai, Shanghai, China (On-Site)
3 Months ago
Zeeco, Inc. - Equipment Engineer

Zeeco, Inc.

Shanghai, China (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Games talent (Staffing and recruiting) - Senior Data Engineer

Games talent (Staffing and recruiting)

(Remote)
2 Months ago
Workato - Senior Java Engineer (CDC and Data Integration)

Workato

Sofia, Sofia City Province, Bulgaria (On-Site)
1 Month ago
LeoVegas - Data Engineer - Sportsbook

LeoVegas

Stockholm, Stockholm County, Sweden (Hybrid)
1 Year ago
YouGov - VP Analytics, Data Products

YouGov

United States (Remote)
3 Weeks ago
Dentsu - Data & Analytics Strategy Specialist

Dentsu

Taipei City, Taiwan (On-Site)
3 Weeks ago
luxsoft - Business analyst

luxsoft

Irvine, California, United States (On-Site)
10 Months ago
Apple - Senior Data Scientist - Business Analytics

Apple

Austin, Texas, United States (On-Site)
1 Month ago
ShyftLabs - Principal Data Scientist

ShyftLabs

Toronto, Ontario, Canada (Hybrid)
1 Week ago
Growe - Data Analyst

Growe

Colombia (On-Site)
1 Week ago
Wildlife Studios - Data Scientist

Wildlife Studios

São Paulo, Brazil (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

At Razer, you'll be at the forefront of the most exciting industry in the world — gaming. Evolving forms of gaming require evolving forms of hardware, software and services. That’s where Razer comes in, offering innovative top-of-the-line products and services to allow gamers to fully immerse in the ultimate gaming experience.Getting onboard Razer will place you on a global mission to bring gamers closer to the games they love. Razer is a place to do great work, offering you the opportunity to be a part of a global team across 11 countries. Whether you are a hardcore evangelist who breathe life to the latest and greatest gaming gear or a behind-the-scene hero who runs our global operations, you are assured of a career-changing quest that transcends time zones and culture with one single spell: For Gamers. By Gamers.The journey towards phenomenal-ness won’t come easy. However, we will excel because gamers rely on teamwork. We achieve greatness because we are wicked problem-solvers and tenacious in clinching victories in all that we do. It is the team that makes Razer where it is today and will continue to bring Razer to even greater heights.

Orlando, Florida, United States (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

State Of São Paulo, Brazil (On-Site)

Paramus, New Jersey, United States (On-Site)

Paramus, New Jersey, United States (On-Site)

Irvine, California, United States (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

View All Jobs

Get notified when new jobs are added by Razer

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug