Configuration for LLMLingua-2 internals.
Maximum batch size for processing prompts.
Maximum number of tokens to force in the compression.
Maximum sequence length for the model.
Configuration for LLMLingua-2 internals.