sneaxiy

sneaxiy

43 followers · 4 following

Achievements

x4 x2

Achievements

x4 x2

Stars

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 28,950 6,507 Updated Jun 13, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,491 1,475 Updated Jun 13, 2026

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 9,725 1,283 Updated Jun 11, 2026

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,704 1,058 Updated Apr 30, 2026

NVIDIA / nccl

Optimized primitives for collective multi-GPU communication

C++ 4,808 1,295 Updated Jun 13, 2026

mlcommons / training_results_v2.0

This repository contains the results and code for the MLPerf™ Training v2.0 benchmark.

C++ 29 24 Updated Feb 23, 2024

sql-machine-learning / sqlflow

Brings SQL and AI together.

Go 5,182 705 Updated Apr 18, 2024

Tencent / TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,546 206 Updated Jul 18, 2025

Jittor / jittor

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

Python 3,226 321 Updated Jun 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sneaxiy

Achievements

Achievements

Block or report sneaxiy

Stars

sgl-project / sglang

modelscope / ms-swift

deepseek-ai / DeepEP

deepseek-ai / FlashMLA

NVIDIA / nccl

mlcommons / training_results_v2.0

sql-machine-learning / sqlflow

Tencent / TurboTransformers

Jittor / jittor