Skip to content

Code Scanner

This scanner can be particularly useful in applications that need to accept only code snippets in specific languages.

Attack scenario

In some contexts, having a language model inadvertently produce code in its output might be deemed undesirable or risky. For instance, a user might exploit the model to generate malicious scripts or probe it for potential vulnerabilities. Controlling and inspecting the code in the model's output can be paramount in ensuring user safety and system integrity.

How it works

Utilizing philomath-1209/programming-language-identification model, the scanner can identify code snippets within prompts across various programming languages. Developers can configure the scanner to either allow or ban specific languages, thus retaining full control over which types of code can appear in user queries.

The scanner is currently limited to extracting and detecting code snippets from Markdown in the following languages:

  • ARM Assembly
  • AppleScript
  • C
  • C#
  • C++
  • COBOL
  • Erlang
  • Fortran
  • Go
  • Java
  • JavaScript
  • Kotlin
  • Lua
  • Mathematica/Wolfram Language
  • PHP
  • Pascal
  • Perl
  • PowerShell
  • Python
  • R
  • Ruby
  • Rust
  • Scala
  • Swift
  • Visual Basic .NET
  • jq

Note

In case, you want to ban code snippets, you can use the BanCode scanner.

Usage

from llm_guard.output_scanners import Code

scanner = Code(languages=["python"], is_blocked=True)
sanitized_output, is_valid, risk_score = scanner.scan(prompt, model_output)

Optimization Strategies

Read more

Benchmarks

Test setup:

  • Platform: Amazon Linux 2
  • Python Version: 3.11.6
  • Input length: 159
  • Test times: 5

Run the following script:

python benchmarks/run.py output Code

Results:

Instance Latency Variance Latency 90 Percentile Latency 95 Percentile Latency 99 Percentile Average Latency (ms) QPS
AWS m5.xlarge 2.76 109.82 136.05 157.04 57.16 2781.88
AWS m5.xlarge with ONNX 0.03 26.15 28.35 30.11 20.22 7864.68
AWS g5.xlarge GPU 32.10 273.88 363.37 434.96 94.52 1682.22
AWS g5.xlarge GPU with ONNX 0.01 6.79 8.32 9.53 3.73 42667.01