autogen.agentchat.contrib.capabilities.transforms.MessageTokenLimiter

API Reference

autogen
- Overview
- AFTER_WORK
- AfterWork
- AfterWorkOption
- Agent
- AgentNameConflictError
- AssistantAgent
- Cache
- ChatCompletion
- ChatResult
- Completion
- ConversableAgent
- GroupChat
- GroupChatManager
- InvalidCarryOverTypeError
- ModelClient
- NoEligibleSpeakerError
- ON_CONDITION
- OnCondition
- OpenAIWrapper
- ReasoningAgent
- SenderRequiredError
- SwarmAgent
- SwarmResult
- ThinkNode
- UPDATE_SYSTEM_MESSAGE
- UndefinedNextAgentError
- UpdateSystemMessage
- UserProxyAgent
- a_initiate_swarm_chat
- config_list_from_dotenv
- config_list_from_json
- config_list_from_models
- config_list_gpt4_gpt35
- config_list_openai_aoai
- filter_config
- gather_usage_summary
- get_config_list
- initiate_chats
- initiate_swarm_chat
- register_function
- register_hand_off
- visualize_tree
- agentchat
  - Overview
  - AFTER_WORK
  - ContextStr
  - ON_CONDITION
  - SwarmAgent
  - SwarmResult
  - UPDATE_SYSTEM_MESSAGE
  - UpdateSystemMessage
  - a_initiate_chats
  - register_hand_off
  - chat
  - contrib
    - agent_eval
    - agent_optimizer
    - capabilities
      - agent_capability
      - generate_images
      - teachability
      - text_compressors
      - tools_capability
      - transform_messages
      - transforms
        Overview
        MessageHistoryLimiter
        MessageTokenLimiter
        MessageTransform
        TextMessageCompressor
        TextMessageContentName
      - transforms_util
      - vision_capability
    - captainagent
    - gpt_assistant_agent
    - graph_rag
    - img_utils
    - llamaindex_conversable_agent
    - llava_agent
    - math_user_proxy_agent
    - multimodal_conversable_agent
    - qdrant_retrieve_user_proxy_agent
    - rag
    - reasoning_agent
    - retrieve_assistant_agent
    - retrieve_user_proxy_agent
    - society_of_mind_agent
    - swarm_agent
    - text_analyzer_agent
    - vectordb
    - web_surfer
  - realtime
  - utils
- agents
- browser_utils
- cache
- code_utils
- coding
- doc_utils
- exception_utils
- formatting_utils
- graph_utils
- import_utils
- interop
- io
- logger
- math_utils
- messages
- oai
- retrieve_utils
- runtime_logging
- token_count_utils
- tools
- types

MessageTokenLimiter

MessageTokenLimiter(
    max_tokens_per_message: int | None = None,
    max_tokens: int | None = None,
    min_tokens: int | None = None,
    model: str = 'gpt-3.5-turbo-0613',
    filter_dict: dict[str, Any] | None = None,
    exclude_filter: bool = True
)

Truncates messages to meet token limits for efficient processing and response generation.
This transformation applies two levels of truncation to the conversation history:
1. Truncates each individual message to the maximum number of tokens specified by max_tokens_per_message.
2. Truncates the overall conversation history to the maximum number of tokens specified by max_tokens.
NOTE: Tokens are counted using the encoder for the specified model. Different models may yield different token counts for the same text.
NOTE: For multimodal LLMs, the token count may be inaccurate as it does not account for the non-text input (e.g images).
The truncation process follows these steps in order:
1. The minimum tokens threshold (min_tokens) is checked (0 by default). If the total number of tokens in messages are less than this threshold, then the messages are returned as is. In other case, the following process is applied.
2. Messages are processed in reverse order (newest to oldest).
3. Individual messages are truncated based on max_tokens_per_message. For multimodal messages containing both text and other types of content, only the text content is truncated.
4. The overall conversation history is truncated based on the max_tokens limit. Once the accumulated token count exceeds this limit, the current message being processed get truncated to meet the total token count and any remaining messages get discarded.
5. The truncated conversation history is reconstructed by prepending the messages to a new list to preserve the original message order.

Parameters:

Name	Description
`max_tokens_per_message`	Type: int \| None Default: None
`max_tokens`	Type: int \| None Default: None
`min_tokens`	Type: int \| None Default: None
`model`	Type: str Default: ‘gpt-3.5-turbo-0613’
`filter_dict`	Type: dict[str, typing.Any] \| None Default: None
`exclude_filter`	Type: bool Default: True

Instance Methods

apply_transform

apply_transform(self, messages: list[dict[str, Any]]) -> list[dict[str, Any]]

Applies token truncation to the conversation history.

Parameters:

Name	Description
`messages`	The list of messages representing the conversation history. Type: list[dict[str, typing.Any]]

Returns:

Type	Description
list[dict[str, typing.Any]]	List[Dict]: A new list containing the truncated messages up to the specified token limits.

get_logs

get_logs(
    self,
    pre_transform_messages: list[dict[str, Any]],
    post_transform_messages: list[dict[str, Any]]
) -> tuple[str, bool]

Parameters:

Name	Description
`pre_transform_messages`	Type: list[dict[str, typing.Any]]
`post_transform_messages`	Type: list[dict[str, typing.Any]]

MessageHistoryLimiter MessageTransform

On this page

MessageTokenLimiter
Instance Methods
apply_transform
get_logs

API Reference

​MessageTokenLimiter

​Instance Methods

​apply_transform

​get_logs

MessageTokenLimiter

Instance Methods

apply_transform

get_logs