FEAT: Scenario DatasetConfiguration #1288

rlundeen2 · 2025-12-29T19:24:33Z

This PR introduces DatasetConfiguration class that is passed to scenarios in initialize_async to address several pain points.

Big available default datasets

It was tough to change the default datasets - some were too big and some too small. As an example, Foundry used harm_bench, which is too big for the default. So it would randomly select 4 by default. But how could users run against all 100? There was no way to configure this. On the flip side garak.encoding_scenario had a large dataset that took a very long time to run, but there was no way to make it small by default and still have the entire dataset available.

This change allows scenarios to be configured with a default (e.g. name=harm_bench, max=4). But users can easily configure both the dataset name or max differently.

It also helps with our end to end tests, which were taking a million years because encoding scenario had so many datasets (this pr reduces to only 3 by default)

Allows dataset params from the front end

This also allows users to specify the dataset names they want to use from pyrit_scan and pyrit_shell. Previously, users had to use the dataset defaults.

--dataset-names DATASET_NAMES [DATASET_NAMES ...]
Dataset names to load for the scenario (overrides scenario defaults)
--max-dataset-size MAX_DATASET_SIZE
Maximum number of seed groups to use (randomly samples if dataset is larger)

Deprecates Incompatible Parameters

objective, seed_prompts are now deprecated as initialization parameters for scenarios

Tests

Unit tests added!
End to end tests passed 2c281be

…_12_28_scenario_dataset

romanlutz · 2025-12-30T22:45:13Z

pyrit/prompt_target/openai/openai_chat_target.py

+        # Check for unexpected response type (e.g., API returned a string during content filter)
+        if not hasattr(response, "choices"):
+            raise PyritException(
+                message=f"Unexpected response type from API: {type(response).__name__}. "
+                f"Expected ChatCompletion object with 'choices' attribute. Response: {str(response)[:200]}"
+            )
+
        # Check for missing choices
        if not response.choices:
            raise PyritException(message="No choices returned in the completion response.")


Suggested change

# Check for unexpected response type (e.g., API returned a string during content filter)

if not hasattr(response, "choices"):

raise PyritException(

message=f"Unexpected response type from API: {type(response).__name__}. "

f"Expected ChatCompletion object with 'choices' attribute. Response: {str(response)[:200]}"

)

# Check for missing choices

if not response.choices:

raise PyritException(message="No choices returned in the completion response.")

# Check for missing choices

if not hasattr(response, "choices") or not response.choices:

raise PyritException(message="No choices returned in the completion response.")

romanlutz · 2025-12-30T22:47:35Z

pyrit/scenario/core/dataset_configuration.py

+        Args:
+            seed_groups (Optional[List[SeedGroup]]): Explicit list of SeedGroups to use.
+            dataset_names (Optional[List[str]]): Names of datasets to load from memory.
+            max_dataset_size (Optional[int]): If set, randomly samples up to this many SeedGroups.


Maybe I've taken one too many combinatorics classes but we should probably specify whether this will be sampling with/without replacement.

rlundeen2 added 12 commits December 28, 2025 15:39

Adding DatasetConfiguration

efbc8fe

updating tests

b59d651

updated

1ea0297

pre-commit

975c30e

test fixc

726b8ef

Merge branch 'main' into users/rlundeen/2025_12_28_scenario_dataset

4088a8e

fixing max

a73bb1f

Fixing bug with system prompt

f2cd912

fixing tests

f95d4ff

adding validation for prompt_target prepended_conversation combos

504f7bd

Merge branch 'users/rlundeen/2025_12_29_bug' into users/rlundeen/2025…

e179dc5

…_12_28_scenario_dataset

adding debugging to openaitarget

2c281be

romanlutz reviewed Dec 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FEAT: Scenario DatasetConfiguration #1288

FEAT: Scenario DatasetConfiguration #1288

Uh oh!

rlundeen2 commented Dec 29, 2025 •

edited

Loading

Uh oh!

romanlutz Dec 30, 2025

Uh oh!

romanlutz Dec 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

FEAT: Scenario DatasetConfiguration #1288

Are you sure you want to change the base?

FEAT: Scenario DatasetConfiguration #1288

Uh oh!

Conversation

rlundeen2 commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Big available default datasets

Allows dataset params from the front end

Deprecates Incompatible Parameters

Tests

Uh oh!

romanlutz Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

romanlutz Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rlundeen2 commented Dec 29, 2025 •

edited

Loading