Projects

Projects I've built or joined — give a star if you like!

  • Project cover: Qwen3-INT8
    Qwen3-INT8

    This project focuses on converting the Qwen3 model to INT8 format to improve inference speed and significantly reduce GPU memory consumption, making it more efficient for deployment.

©2025 Shawn. RSS Sitemap
| Already wrote 1.2k words