Skip to content
View nicolaus-huang's full-sized avatar
🌏
🌏

Block or report nicolaus-huang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nicolaus-huang/README.md

Hi there πŸ‘‹, I'm Nicolaus Huang (黄施捷)

Algorithm Engineer @ Alibaba | AI Researcher | Open Source Contributor

I'm passionate about building cutting-edge AI systems that serve real users at scale. Currently working on the Qwen app at Alibaba, where I focus on multimodal AI, diffusion models, and large-scale data engineering.


πŸ”¬ What I Do

  • Multimodal AI Systems: Building instant cross-modal retrieval engines over 6B+ images
  • Diffusion Models: Contributing to image and video generation foundation models
  • AI System Engineering: Model post-training, quantization, deployment, and optimization
  • Large-Scale Data Processing: Data pipeline engineering, filtering, and re-balancing

πŸš€ Featured Projects

Real-time streaming avatar generation with infinite length. Achieving 20 FPS with a 14B-parameter diffusion model through innovative pipeline parallelism.

An efficient image generation foundation model with single-stream diffusion transformer architecture.

Training commercial-level video generation models. Contributed to data processing pipeline and inference engine optimization.

Learning to draw from sequence data. Presented at SIGGRAPH Asia 2024.

Learning customized instructional image editor from few-shot examples. Published at ICCV 2025.

A mathematics encyclopedia built by undergraduate students, embracing the Agent era for mathematical education.


πŸ”₯ I'm Recruiting Research Interns!

I have multiple exciting opportunities for passionate researchers:

  • Digital Avatar Interns: Building the future of personal AI assistants with Live Avatar
  • Multimodal Data Diagnostic Tool: Creating semantic retrieval systems for 100B+ data points
  • [Open Source] Easymath-wiki Contributors: Transforming mathematical education with AI agents, not interns but collaborators

Interested? Check out my website for details or reach out at zhengjie.hsj@alibaba-inc.com

πŸ’‘ Fun Fact: When I'm not coding or researching, I enjoy strategic games, writing, and reading!

Pinned Loading

  1. showlab/PhotoDoodle showlab/PhotoDoodle Public

    [ICCV 2025] Code Implementation of "ArtEditor: Learning Customized Instructional Image Editor from Few-Shot Examples"

    Python 432 28

  2. ProcessPainter ProcessPainter Public archive

    [SIGGRAPH Asia 2024] Painting process generating using diffusion models

    Python 94 5

  3. hpcaitech/Open-Sora hpcaitech/Open-Sora Public

    Open-Sora: Democratizing Efficient Video Production for All

    Python 28.7k 2.9k

  4. Easymath-wiki/Easymath-wiki Easymath-wiki/Easymath-wiki Public

    Know the things about math where you want to know immediately.

    HTML 123 12