Skip to content

test: Align QA BF16 with ml_dtypes and generate ONNX BF16 models#8782

Merged
yinggeh merged 1 commit into
mainfrom
yinggeh/tri-801-deprecate-bf16-to-fp32-conversion-in-python-client-library
May 18, 2026
Merged

test: Align QA BF16 with ml_dtypes and generate ONNX BF16 models#8782
yinggeh merged 1 commit into
mainfrom
yinggeh/tri-801-deprecate-bf16-to-fp32-conversion-in-python-client-library

Conversation

@yinggeh

@yinggeh yinggeh commented May 15, 2026

Copy link
Copy Markdown
Contributor

What does the PR do?

Aligns server QA with the Python client change for BF16: BF16 I/O uses ml_dtypes.bfloat16 instead of float32 truncation.

Extends the QA model factory so ONNX BF16 models are generated.

L0_backend_onnxruntime now copies the generated onnx_bf16_bf16_bf16 model from the QA data directory and exercises add/sub with batched BF16 inputs; removes the standalone gen_add_bf16_onnx_model.py generator.

Checklist

  • PR title reflects the change and is of format <commit_type>: <Title>
  • Changes are described in the pull request.
  • Related issues are referenced.
  • Populated github labels field
  • Added test plan and verified test passes.
  • Verified that the PR passes existing CI.
  • Verified copyright is correct on all changed files.
  • Added succinct git squash message before merging ref.
  • All template sections are filled out.
  • Optional: Additional screenshots for behavior/output changes with before/after.

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

  • test

Related PRs:

triton-inference-server/client#897

Where should the reviewer start?

Test plan:

  • CI Pipeline ID:
    51358164

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

@yinggeh yinggeh changed the title test: Align QA BF16 with ml_dtypes and generate ONNX BF16 models (TRI-801) test: Align QA BF16 with ml_dtypes and generate ONNX BF16 models May 15, 2026
@yinggeh yinggeh requested review from mudit-eng, pskiran1 and whoisj May 15, 2026 03:13
@yinggeh yinggeh self-assigned this May 15, 2026
@yinggeh yinggeh added the PR: test Adding missing tests or correcting existing test label May 15, 2026
@yinggeh yinggeh merged commit 665030a into main May 18, 2026
3 checks passed
@yinggeh yinggeh deleted the yinggeh/tri-801-deprecate-bf16-to-fp32-conversion-in-python-client-library branch May 18, 2026 22:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

PR: test Adding missing tests or correcting existing test

Development

Successfully merging this pull request may close these issues.

2 participants