职位描述
该职位还未进行加V认证,请仔细了解后再进行投递!
Position Overview
We are seeking a skilled Data Engineer to design, build, and optimize our data pipelines and infrastructure. The ideal candidate will have strong experience in handling large-scale video datasets and building efficient data processing systems for machine learning applications.
Key Responsibilities
Design and implement scalable data pipelines for processing, storing, and managing large-scale video datasets
Build and maintain data infrastructure for training data preparation and feature engineering
Develop efficient ETL processes for various data sources including videos, images, and metadata
Create and optimize data storage solutions for high-performance data access
Implement data quality monitoring and validation systems
Collaborate with ML researchers to support model training and evaluation needs
Ensure data security and compliance across all data operations
Required Qualifications
Master's degree in Computer Science, Software Engineering, or related field
8+ years of experience in data engineering roles at tech companies
Strong programming skills in Python, Java, SQL and Shell
Experience with big data technologies (Spark, Hadoop ecosystem)
Proven track record in building and maintaining data pipelines
Experience with cloud platforms (AWS/GCP/Azure or Alibaba Cloud/Tencent Cloud)
Strong understanding of data modeling and database design
Preferred Qualifications
Experience with video processing and storage systems
Knowledge of ML/AI data pipeline requirements
Familiarity with distributed computing systems and high-performance computing
Experience with streaming data processing
Understanding of data privacy and security best practices
Experience with Cloud services and data infrastructure
Bilingual proficiency (English/Chinese)
Technical Skills
Data Processing & Storage
Databases: PostgreSQL, MongoDB, Redis
Big Data: Spark, Hadoop, Hive
Data Warehousing: Snowflake, Amazon Redshift
Stream Processing: Kafka, Apache Flink
Cloud & Infrastructure
Cloud Platforms: AWS/GCP/Azure, Alibaba Cloud/Tencent Cloud
Container Orchestration: Docker, Kubernetes
Infrastructure as Code: Terraform, Ansible
Programming & Tools
Languages: Python, SQL, Shell scripting
ETL Tools: Airflow, Luigi
Version Control: Git
Monitoring: Prometheus, Grafana
Video Processing
FFmpeg, OpenCV
Video compression and optimization techniques
Video metadata extraction and management
What We Offer
Opportunity to build critical infrastructure for cutting-edge AI technology
Competitive salary and equity package
Flexible work arrangements
Professional development opportunities
Modern tech stack and tools
Collaborative and innovative work environment
Health and wellness benefits
Location
Hong Kong (on-site, Hong Kong Science and Technology Park)
Expected Impact
Shape the foundation of our data infrastructure
Influence architectural decisions
Build and mentor a world-class technology team
工作地点
地址:香港香港香港沙田区香港科学园10W栋317-318
![](http://img.jrzp.com/jrzpfile/cityrcw/SearchJob/images/jg.png)
![](https://img.jrzp.com/images_server/comm/nan1923.png)
职位发布者
张先生HR
Video Rebirth Limited
![](http://img.jrzp.com/jrzpfile/cityrcw/images/sfrz_yrz.png)
-
计算机软件
-
11-20人
-
外商独资·外企办事处
-
香港科学园10W栋317-318