feat: update readme and profiling req by xieofxie · Pull Request #241 · microsoft/olive-recipes

xieofxie · 2026-02-12T06:30:50Z

No description provided.

Copilot

Pull request overview

Updates model-specific AITK documentation and profiling dependencies to reflect newer quantization/runtime options, and adjusts repo copy/check metadata accordingly.

Changes:

Refresh several model READMEs: rename AMD NPU workflow to “Quark Quantization”, add “int4 Quantization for QNN GPU”, and remove the AutoAWQ mention in the DML workflow.
Add onnxruntime-genai-winml==0.11.2 to the profiling requirements set.
Remove a README copy step from meta-llama-Llama-3.1-8B-Instruct’s _copy.json.config and decrement copyCheck accordingly.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
microsoft-Phi-3.5-mini-instruct/aitk/README.md	Updates workflow list and prerequisites text.
meta-llama-Llama-3.2-1B-Instruct/aitk/README.md	Updates workflow list and prerequisites text.
deepseek-ai-DeepSeek-R1-Distill-Qwen-1.5B/aitk/README.md	Updates workflow list and prerequisites text.
Qwen-Qwen2.5-1.5B-Instruct/aitk/README.md	Updates workflow list and prerequisites text.
meta-llama-Llama-3.1-8B-Instruct/aitk/_copy.json.config	Stops copying README from another model template.
.aitk/requirements/requirements-Profiling.txt	Adds `onnxruntime-genai-winml` to profiling environment.
.aitk/configs/checks.json	Updates expected copy check count.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-12T06:35:09Z

Qwen-Qwen2.5-1.5B-Instruct/aitk/README.md

+- Quark Quantization for AMD NPU
 - PTQ + AOT for QNN NPU
   + This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**
+- Int4 Quantization for QNN GPU


The workflow list now mentions “Quark Quantization for AMD NPU” and “int4 Quantization for QNN GPU”, but this README doesn’t include any corresponding sections/usage guidance (and there’s no other mention of Quark/QNN GPU later). Either add links/sections that explain how to run these workflows (e.g., which *.json.config to execute), or remove the bullets to avoid advertising unsupported steps.

Suggested change

- Quark Quantization for AMD NPU

- PTQ + AOT for QNN NPU

+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**

- Int4 Quantization for QNN GPU

- PTQ + AOT for QNN NPU

+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**

Copilot · 2026-02-12T06:35:10Z

Qwen-Qwen2.5-1.5B-Instruct/aitk/README.md

-   + This process uses AutoAWQ and ModelBuilder
+   + This process uses ModelBuilder
+
+**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**


Capitalize product/term names in this prerequisite sentence for readability/accuracy (Python, Visual Studio 2022, Build Tools, C++). Also consider using the official Visual Studio wording (“C++ development workload/tools”) rather than “modules”.

Suggested change

**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**

**For some Python packages, users need to install Visual Studio 2022 or Visual Studio 2022 Build Tools with the C++ development workload (or C++ build tools) installed.**

Copilot · 2026-02-12T06:35:10Z

microsoft-Phi-3.5-mini-instruct/aitk/README.md

+- Quark Quantization for AMD NPU
 - PTQ + AOT for QNN NPU
   + This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**
+- Int4 Quantization for QNN GPU


The workflow list now mentions “Quark Quantization for AMD NPU” and “int4 Quantization for QNN GPU”, but this README doesn’t include any corresponding sections/usage guidance (and there’s no other mention of Quark/QNN GPU later). Either add links/sections that explain how to run these workflows (e.g., which *.json.config to execute), or remove the bullets to avoid advertising unsupported steps.

Suggested change

- Quark Quantization for AMD NPU

- PTQ + AOT for QNN NPU

+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**

- Int4 Quantization for QNN GPU

- PTQ + AOT for QNN NPU

+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**

Copilot · 2026-02-12T06:35:10Z

microsoft-Phi-3.5-mini-instruct/aitk/README.md

-   + This process uses AutoAWQ and ModelBuilder
+   + This process uses ModelBuilder
+
+**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**


Capitalize product/term names in this prerequisite sentence for readability/accuracy (Python, Visual Studio 2022, Build Tools, C++). Also consider using the official Visual Studio wording (“C++ development workload/tools”) rather than “modules”.

Suggested change

**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**

**For some Python packages, users need to install Visual Studio 2022 or Visual Studio 2022 Build Tools with the C++ development tools workload.**

Copilot · 2026-02-12T06:35:11Z

meta-llama-Llama-3.2-1B-Instruct/aitk/README.md

+- Quark Quantization for AMD NPU
 - PTQ + AOT for QNN NPU
   + This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**
+- Int4 Quantization for QNN GPU


The workflow list now mentions “Quark Quantization for AMD NPU” and “int4 Quantization for QNN GPU”, but this README doesn’t include any corresponding sections/usage guidance (and there’s no other mention of Quark/QNN GPU later). Either add links/sections that explain how to run these workflows (e.g., which *.json.config to execute), or remove the bullets to avoid advertising unsupported steps.

Suggested change

- Quark Quantization for AMD NPU

- PTQ + AOT for QNN NPU

+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**

- Int4 Quantization for QNN GPU

- PTQ + AOT for QNN NPU

+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**

Copilot · 2026-02-12T06:35:11Z

meta-llama-Llama-3.2-1B-Instruct/aitk/README.md

-   + This process uses AutoAWQ and ModelBuilder
+   + This process uses ModelBuilder
+
+**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**


Capitalize product/term names in this prerequisite sentence for readability/accuracy (Python, Visual Studio 2022, Build Tools, C++). Also consider using the official Visual Studio wording (“C++ development workload/tools”) rather than “modules”.

Suggested change

**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**

**For some Python packages, users need to install Visual Studio 2022 with the C++ development workload or Visual Studio 2022 Build Tools with the C++ build tools.**

Copilot · 2026-02-12T06:35:11Z

deepseek-ai-DeepSeek-R1-Distill-Qwen-1.5B/aitk/README.md

+- Quark Quantization for AMD NPU
 - PTQ + AOT for QNN NPU
   + This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**
+- Int4 Quantization for QNN GPU


The workflow list now mentions “Quark Quantization for AMD NPU” and “int4 Quantization for QNN GPU”, but this README doesn’t include any corresponding sections/usage guidance (and there’s no other mention of Quark/QNN GPU later). Either add links/sections that explain how to run these workflows (e.g., which *.json.config to execute), or remove the bullets to avoid advertising unsupported steps.

Suggested change

- Quark Quantization for AMD NPU

- PTQ + AOT for QNN NPU

+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**

- Int4 Quantization for QNN GPU

- PTQ + AOT for QNN NPU

+ This process extends the QDQ flow and compiling specifically for **Qualcomm NPUs**

Copilot · 2026-02-12T06:35:11Z

deepseek-ai-DeepSeek-R1-Distill-Qwen-1.5B/aitk/README.md

-   + This process uses AutoAWQ and ModelBuilder
+   + This process uses ModelBuilder
+
+**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**


Capitalize product/term names in this prerequisite sentence for readability/accuracy (Python, Visual Studio 2022, Build Tools, C++). Also consider using the official Visual Studio wording (“C++ development workload/tools”) rather than “modules”.

Suggested change

**For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.**

**For some Python packages, users need to install Visual Studio 2022 with the C++ development workload, or Visual Studio 2022 Build Tools with the C++ build tools installed.**

update readme and profiling req

7d92a45

xieofxie requested review from a team as code owners February 12, 2026 06:30

Copilot AI review requested due to automatic review settings February 12, 2026 06:30

Copilot started reviewing on behalf of xieofxie February 12, 2026 06:31 View session

ssss141414 previously approved these changes Feb 12, 2026

View reviewed changes

I

fbaa57e

xieofxie dismissed ssss141414’s stale review via fbaa57e February 12, 2026 06:33

ssss141414 approved these changes Feb 12, 2026

View reviewed changes

xieofxie merged commit fbff059 into main Feb 12, 2026
7 checks passed

xieofxie deleted the hualxie/minor branch February 12, 2026 06:34

Copilot AI reviewed Feb 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: update readme and profiling req#241

feat: update readme and profiling req#241
xieofxie merged 2 commits intomainfrom
hualxie/minor

xieofxie commented Feb 12, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 12, 2026

Uh oh!

Copilot AI Feb 12, 2026

Uh oh!

Copilot AI Feb 12, 2026

Uh oh!

Copilot AI Feb 12, 2026

Uh oh!

Copilot AI Feb 12, 2026

Uh oh!

Copilot AI Feb 12, 2026

Uh oh!

Copilot AI Feb 12, 2026

Uh oh!

Copilot AI Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	For some python packages, users need to install visual studio 2022 or visual studio 2022 build tools with c++ development tools modules.
	For some Python packages, users need to install Visual Studio 2022 or Visual Studio 2022 Build Tools with the C++ development workload (or C++ build tools) installed.

Comments

Conversation

xieofxie commented Feb 12, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants