mellea.stdlib.sampling.best_of_n
Best of N Sampling Strategy.
Classes
BestofNSamplingStrategy
Sampling strategy that selects the best response from a set of samples as given by a Requirement Scorer.
Methods:
sample
action: The action object to be sampled.context: The context to be passed to the sampling strategy.backend: The backend used for generating samples.requirements: List of requirements to test against (merged with global requirements).validation_ctx: Optional context to use for validation. If None, validation_ctx = ctx.format: output format for structured outputs.model_options: model options to pass to the backend during generation / validation.tool_calls: True if tool calls should be used during this sampling strategy.show_progress: if true, a tqdm progress bar is used. Otherwise, messages will still be sent to flog.
- A result object indicating the success or failure of the sampling process.
AssertionError: Asserts that all required components (repair, select_from_failure, validate, and generate) are provided before proceeding with the sampling.
select_from_failure
sampled_actions: List of actions that have been executed (without success).sampled_results: List of (unsuccessful) generation results for these actions.sampled_val: List of validation results for the results.
- The index of the result that should be selected as
.value.
repair
old_ctx: The context WITHOUT the last action + output.new_ctx: The context including the last action + output.past_actions: List of actions that have been executed (without success).past_results: List of (unsuccessful) generation results for these actions.past_val: List of validation results for the results.
- The next action component and context to be used for the next generation attempt.