Code Scanner

This scanner can be particularly useful in applications that need to accept only code snippets in specific languages.

Attack scenario

In some contexts, having a language model inadvertently produce code in its output might be deemed undesirable or risky. For instance, a user might exploit the model to generate malicious scripts or probe it for potential vulnerabilities. Controlling and inspecting the code in the model's output can be paramount in ensuring user safety and system integrity.

How it works

Utilizing philomath-1209/programming-language-identification model, the scanner can identify code snippets within prompts across various programming languages. Developers can configure the scanner to either allow or ban specific languages, thus retaining full control over which types of code can appear in user queries.

The scanner is currently limited to extracting and detecting code snippets from Markdown in the following languages:

ARM Assembly
AppleScript
C
C#
C++
COBOL
Erlang
Fortran
Go
Java
JavaScript
Kotlin
Lua
Mathematica/Wolfram Language
PHP
Pascal
Perl
PowerShell
Python
R
Ruby
Rust
Scala
Swift
Visual Basic .NET
jq

Note

In case, you want to ban code snippets, you can use the BanCode scanner.

Usage

from llm_guard.output_scanners import Code

scanner = Code(languages=["python"], is_blocked=True)
sanitized_output, is_valid, risk_score = scanner.scan(prompt, model_output)

Optimization Strategies

Benchmarks

Test setup:

Platform: Amazon Linux 2
Python Version: 3.11.6
Input length: 159
Test times: 5

Run the following script:

python benchmarks/run.py output Code

Results:

Instance	Latency Variance	Latency 90 Percentile	Latency 95 Percentile	Latency 99 Percentile	Average Latency (ms)	QPS
AWS m5.xlarge	2.76	109.82	136.05	157.04	57.16	2781.88
AWS m5.xlarge with ONNX	0.03	26.15	28.35	30.11	20.22	7864.68
AWS g5.xlarge GPU	32.10	273.88	363.37	434.96	94.52	1682.22
AWS g5.xlarge GPU with ONNX	0.01	6.79	8.32	9.53	3.73	42667.01