当前位置:首页>职位列表>职位详情
Research Scientist 100000-150000元
香港香港 3年以上 博士
  • 补充医疗保险
  • 创业公司
  • 强积金
Video Rebirth Limited 2025-02-20 10:06:33 983人关注
职位描述
Position Overview We are seeking an experienced AI Research Scientist to lead foundation model development initiatives. The ideal candidate will have hands-on experience in training large-scale models at major tech companies and a proven track record in advancing the state-of-the-art in foundation models. Key Responsibilities Lead the architecture design and training of large-scale foundation models Develop and optimize model training pipelines for distributed systems Drive research initiatives in model scaling, efficiency, and performance Implement innovative approaches to improve model capabilities and training efficiency Collaborate with the engineering team to productionize research breakthroughs Guide technical decisions related to model architecture and training strategies Required Qualifications Ph.D. in Computer Science, Machine Learning, or related field 3+ years of experience in training large-scale models at major tech companies, including: International tech leaders (e.g., Google, Meta, Microsoft, OpenAI, Anthropic) OR Leading Chinese tech companies (e.g., ByteDance, Alibaba, Baidu, Tencent, SenseTime, Huawei) Proven experience with distributed training systems and large-scale model optimization Deep understanding of transformer architectures and their variants Strong track record in developing and training foundation models Extensive experience with PyTorch and/or JAX Publication record in top-tier conferences (NeurIPS, ICML, ICLR) Preferred Qualifications Experience with both Chinese and international AI ecosystems Familiarity with Chinese AI infrastructure (e.g., ModelArts, PAI, ByteMLab) Background in scaling laws and efficient training strategies Experience with video generation models or multimodal architectures Track record of open-source contributions to major ML frameworks Experience with ML infrastructure design and implementation Familiarity with mixed-precision training and model parallelism Experience with custom CUDA kernels and optimization Technical Expertise Large-Scale Training: Distributed training frameworks, model parallelism strategies Infrastructure: International cloud platforms (AWS/GCP) Chinese cloud platforms (Alibaba Cloud, Tencent Cloud, Huawei Cloud) Languages: Python, CUDA, C++ (optional) Frameworks: Standard: PyTorch, JAX, DeepSpeed, Megatron-LM Chinese ecosystem: PaddlePaddle, MindSpore (plus) Development Tools: Git, Docker, Kubernetes Monitoring: Weights & Biases, MLflow, or similar tools What We Offer Opportunity to shape the future of foundation models in video generation Leadership role in technical decision-making Access to substantial computing resources and infrastructure Competitive compensation package including equity Regular collaboration with top researchers in the field Support for conference attendance and research publication International exposure and collaboration opportunities Location Hong Kong (on-site, Hong Kong Science and Technology Park) Expected Impact Drive the development of next-generation foundation models Lead research initiatives that push the boundaries of model capabilities Build and mentor a world-class research team
联系方式
注:联系我时,请说是在今日招聘网上看到的。
工作地点
地址:香港香港沙田区香港科学园10W栋317-318
以担保或任何理由索取财物,扣押证照,均涉嫌违法,请提高警惕

若您已有简历,可直接登录登录

  • 省份

    注:0表示面议
    获取验证码
    保存并投递
    投递简历
      马上投递
      投递简历
        马上投递

        企业
        服务热线

        • 400-6680-889
        1. 登录
        2. 注册
        客户服务热线:
        400-6680-889
        在线客服:
        点击这里给我发消息 898995850
        工作日:
        8:30-18:00