Quantization and Security of Code-LLMs

Large language models for code can help with tasks like translating code, finding bugs, and generating code from text. As these models grow, companies often use quantization—reducing model precision—to save memory and improve efficiency. Our research examines how quantization affects security, specifically the risk of hidden malicious behaviors (or “trojans”). Studying Llama-2-7B and CodeLlama-7B on SQL code generation, we find that quantization can improve performance and reduce attack risks in some models, while having little effect in others.

This repository provides the code, experiments, and analysis for this research. It explores the trade-offs between model efficiency and security, offering insights into the robustness of quantized models under adversarial conditions.

📄 Paper

Capturing the Effects of Quantization on Trojans in Code LLMs

👉 Read the paper here

TL'DR

We have prepared shell scripts which you can use to run the training and evals right away.

Model Train

Start a new tmux session. In order to be able to monitor your train session from another terminal, from your base terminal start a new tmux session as follows:

tmux set-option -g history-limit 10000 \; new-session

We add enough lines for the tmux shell buffer (See this StackOverflow link for details). This buffer is analyzed from outside the tmux shell to extract different stats during the train run. We show how to do that below.

Set up the config. In the file, src/config.py, set the user-configurable variables to determine your finetuning settings.

Start the train session. Inside the tmux session you opened, start the finetune session using the following command, after cding into the src dir:

source train.sh

Monitor your train session. Exit the tmux shell. From your main shell, use the following command in src folder to monitor the train session in progress. (Here we use your finetuning session is running in tmux shell with id 34):

source get_train_run_update.sh 34

This generates an output directory, run_34/extracted_output which consists of your train stats (per step loss scores) along with plots.

Finalize your train session. Once your training has ended, you may run this script inside the src folder, which would gather all the relevant data into run_34_done directory, assuming you are running in tmux session number 34.

source finalize_train_run.sh 34

Poisoning the data

Run src/data_transformation to poison a given dataset:

usage: poison_dataset.py [-h] --path PATH --poison_rate POISON_RATE --trig_cat TRIG_CAT

Process some inputs for dataset poisoning.

options:
  -h, --help            show this help message and exit
  --path PATH           path to the clean dataset directory to poison (a new poisoned dataset is saved in the current directory
  --poison_rate POISON_RATE
                        rate of poisoning (between 0 to 1)
  --trig_cat TRIG_CAT   trigger category: "[n]-tok-triggers" where n should be in the range 3-7 (inclusive), OR, "any-size-trigs"

References:

https://github.com/ragntune/code-llama-finetune

https://github.com/ragntune/code-llama-finetune/blob/main/fine-tune-code-llama.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 218 Commits
archived		archived
deprecated		deprecated
docs		docs
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Quantization and Security of Code-LLMs

📄 Paper

TL'DR

Model Train

Poisoning the data

References:

About

Uh oh!

Releases

Packages

Languages

AftabHussain/quantized-code-llm-security

Folders and files

Latest commit

History

Repository files navigation

Quantization and Security of Code-LLMs

📄 Paper

TL'DR

Model Train

Poisoning the data

References:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages