Skip to content

Comments

feat: update readme and profiling req#241

Merged
xieofxie merged 2 commits intomainfrom
hualxie/minor
Feb 12, 2026
Merged

feat: update readme and profiling req#241
xieofxie merged 2 commits intomainfrom
hualxie/minor

Conversation

@xieofxie
Copy link
Contributor

No description provided.

@xieofxie xieofxie requested review from a team as code owners February 12, 2026 06:30
Copilot AI review requested due to automatic review settings February 12, 2026 06:30
ssss141414
ssss141414 previously approved these changes Feb 12, 2026
@xieofxie xieofxie merged commit fbff059 into main Feb 12, 2026
7 checks passed
@xieofxie xieofxie deleted the hualxie/minor branch February 12, 2026 06:34
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates model-specific AITK documentation and profiling dependencies to reflect newer quantization/runtime options, and adjusts repo copy/check metadata accordingly.

Changes:

  • Refresh several model READMEs: rename AMD NPU workflow to “Quark Quantization”, add “int4 Quantization for QNN GPU”, and remove the AutoAWQ mention in the DML workflow.
  • Add onnxruntime-genai-winml==0.11.2 to the profiling requirements set.
  • Remove a README copy step from meta-llama-Llama-3.1-8B-Instruct’s _copy.json.config and decrement copyCheck accordingly.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 8 comments.

Show a summary per file
File Description
microsoft-Phi-3.5-mini-instruct/aitk/README.md Updates workflow list and prerequisites text.
meta-llama-Llama-3.2-1B-Instruct/aitk/README.md Updates workflow list and prerequisites text.
deepseek-ai-DeepSeek-R1-Distill-Qwen-1.5B/aitk/README.md Updates workflow list and prerequisites text.
Qwen-Qwen2.5-1.5B-Instruct/aitk/README.md Updates workflow list and prerequisites text.
meta-llama-Llama-3.1-8B-Instruct/aitk/_copy.json.config Stops copying README from another model template.
.aitk/requirements/requirements-Profiling.txt Adds onnxruntime-genai-winml to profiling environment.
.aitk/configs/checks.json Updates expected copy check count.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment on lines +5 to +8
- Quark Quantization for AMD NPU
- PTQ + AOT for QNN NPU
+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**
- Int4 Quantization for QNN GPU
Copy link

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The workflow list now mentions “Quark Quantization for AMD NPU” and “int4 Quantization for QNN GPU”, but this README doesn’t include any corresponding sections/usage guidance (and there’s no other mention of Quark/QNN GPU later). Either add links/sections that explain how to run these workflows (e.g., which *.json.config to execute), or remove the bullets to avoid advertising unsupported steps.

Suggested change
- Quark Quantization for AMD NPU
- PTQ + AOT for QNN NPU
+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**
- Int4 Quantization for QNN GPU
- PTQ + AOT for QNN NPU
+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**

Copilot uses AI. Check for mistakes.
+ This process uses AutoAWQ and ModelBuilder
+ This process uses ModelBuilder

**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**
Copy link

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Capitalize product/term names in this prerequisite sentence for readability/accuracy (Python, Visual Studio 2022, Build Tools, C++). Also consider using the official Visual Studio wording (“C++ development workload/tools”) rather than “modules”.

Suggested change
**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**
**For some Python packages, users need to install Visual Studio 2022 or Visual Studio 2022 Build Tools with the C++ development workload (or C++ build tools) installed.**

Copilot uses AI. Check for mistakes.
Comment on lines +5 to +8
- Quark Quantization for AMD NPU
- PTQ + AOT for QNN NPU
+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**
- Int4 Quantization for QNN GPU
Copy link

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The workflow list now mentions “Quark Quantization for AMD NPU” and “int4 Quantization for QNN GPU”, but this README doesn’t include any corresponding sections/usage guidance (and there’s no other mention of Quark/QNN GPU later). Either add links/sections that explain how to run these workflows (e.g., which *.json.config to execute), or remove the bullets to avoid advertising unsupported steps.

Suggested change
- Quark Quantization for AMD NPU
- PTQ + AOT for QNN NPU
+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**
- Int4 Quantization for QNN GPU
- PTQ + AOT for QNN NPU
+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**

Copilot uses AI. Check for mistakes.
+ This process uses AutoAWQ and ModelBuilder
+ This process uses ModelBuilder

**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**
Copy link

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Capitalize product/term names in this prerequisite sentence for readability/accuracy (Python, Visual Studio 2022, Build Tools, C++). Also consider using the official Visual Studio wording (“C++ development workload/tools”) rather than “modules”.

Suggested change
**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**
**For some Python packages, users need to install Visual Studio 2022 or Visual Studio 2022 Build Tools with the C++ development tools workload.**

Copilot uses AI. Check for mistakes.
Comment on lines +5 to +8
- Quark Quantization for AMD NPU
- PTQ + AOT for QNN NPU
+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**
- Int4 Quantization for QNN GPU
Copy link

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The workflow list now mentions “Quark Quantization for AMD NPU” and “int4 Quantization for QNN GPU”, but this README doesn’t include any corresponding sections/usage guidance (and there’s no other mention of Quark/QNN GPU later). Either add links/sections that explain how to run these workflows (e.g., which *.json.config to execute), or remove the bullets to avoid advertising unsupported steps.

Suggested change
- Quark Quantization for AMD NPU
- PTQ + AOT for QNN NPU
+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**
- Int4 Quantization for QNN GPU
- PTQ + AOT for QNN NPU
+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**

Copilot uses AI. Check for mistakes.
+ This process uses AutoAWQ and ModelBuilder
+ This process uses ModelBuilder

**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**
Copy link

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Capitalize product/term names in this prerequisite sentence for readability/accuracy (Python, Visual Studio 2022, Build Tools, C++). Also consider using the official Visual Studio wording (“C++ development workload/tools”) rather than “modules”.

Suggested change
**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**
**For some Python packages, users need to install Visual Studio 2022 with the C++ development workload or Visual Studio 2022 Build Tools with the C++ build tools.**

Copilot uses AI. Check for mistakes.
Comment on lines +5 to +8
- Quark Quantization for AMD NPU
- PTQ + AOT for QNN NPU
+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**
- Int4 Quantization for QNN GPU
Copy link

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The workflow list now mentions “Quark Quantization for AMD NPU” and “int4 Quantization for QNN GPU”, but this README doesn’t include any corresponding sections/usage guidance (and there’s no other mention of Quark/QNN GPU later). Either add links/sections that explain how to run these workflows (e.g., which *.json.config to execute), or remove the bullets to avoid advertising unsupported steps.

Suggested change
- Quark Quantization for AMD NPU
- PTQ + AOT for QNN NPU
+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**
- Int4 Quantization for QNN GPU
- PTQ + AOT for QNN NPU
+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**

Copilot uses AI. Check for mistakes.
+ This process uses AutoAWQ and ModelBuilder
+ This process uses ModelBuilder

**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**
Copy link

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Capitalize product/term names in this prerequisite sentence for readability/accuracy (Python, Visual Studio 2022, Build Tools, C++). Also consider using the official Visual Studio wording (“C++ development workload/tools”) rather than “modules”.

Suggested change
**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**
**For some Python packages, users need to install Visual Studio 2022 with the C++ development workload, or Visual Studio 2022 Build Tools with the C++ build tools installed.**

Copilot uses AI. Check for mistakes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants