Releases · microsoft/onnxruntime · GitHub
August 12, 2025ai_discoveryinfo
What's new? This release adds an optimized CPU/MLAS implementation of DequantizeLinear (8 bit) and introduces the build option client_package_build, which enables default options that are more appropriate for client/on-device workloads (e.g., disable thread spinning by default). Build System & Packages Add –client_package_build option (#25351) - @jywu-msft Remove the python installation steps from win-qnn-arm64-ci-pipeline.yml (#25552) - @snnn CPU EP Add multithreaded/vectorized implementation o
Read more →