CHANGELOG

Changelog for package automatika_embodied_agents

0.4.2 (2025-09-03)

(feature) Adds udp streaming to IP:PORT as an option to TextToStream component when play_on_device is enabled
(docs) Updates docs to use new web based client
(feature) Adds processing of audio messages in web client
(chore) Removes chainlit based client
(feature) Adds a custom webclient to replace chainlit
(feature) Adds persistent ros node in web client for async stream handling
(feature) Adds warning when not using streaming string msg_type with streaming enabled in components
(feature) Adds streaming string msg for managing streams in external clients
(docs) Adds recipe for vision guided point navigation
(fix) Fixes empty image input for Detection2D msg publication
(fix) Fixes websocket receiving in text to speech
(fix) Fixes keyword argument in detection and tracking publishing
(feature) Adds publishing a singular detection or tracking message from the vision component
Contributors: ahr, mkabtoul

0.4.1 (2025-07-10)

(docs) Updates docs for using planning based MLLMs
(feature) Adds options to get RGBD array from rgbd message callback
(refactor) Breaks complex functions and fixes warmup result logging
(feature) Adds support for planning mllm models, starting with robobrain2.0
(docs) Adds streaming to conversational agent example
Contributors: ahr, mkabtoul

0.4.0 (2025-06-18)

(docs) Adds international readme files
(feature) Adds better connection error messages in clients, adds installation instructions
(chore) Adds debian packaging workflow
(docs) Updates installation instructions
(chore) Updates package names .. ROS Agents -> EmbodiedAgents
(feature) Adds a GenericHTTPClient for using llm and mllm models served on any OpenAI compatible API
(feature) Adds ollama specific inference options to OllamaModel and client
(feature) Adds MeloTTS model to model definitions
(feature) Adds say text method to text to speech for invoking with events
(feature) Adds streaming playback for streaming input in speeech to text component
(fix) Fixes clearing old output in the vision component when getting subscription data in a timed manner
(feature) Adds tensorrt as an onnx provider option for local models
(refactor) Removes sounddevice as a dependancy for text to speech component
(feature) Adds local classification model for Vision component Default model: DEIM: DETR with Improved Matching for Fast Convergence by Huang et al.
(feature) Adds warnings if device for local models is set to GPU and runtime is not available
(feature) Adds hypothesis buffer for publishing confirmed transcripts when using streaming
(feature) Adds asynchronous receiving for streaming websockets client in speech to text component
(refactor) Adds getting inference params just once during node configuration
(fix) Fixes handling of model init params and sending np arrays during inference
(feature) Adds asynchronous publishing of response in LLM component when streaming with websocket client
(feature) Adds local embeddings option using sentence-transformers to ChromaDB client
(feature) Adds ChromaDB http client with ollama embeddigs
(feature) Adds streaming with websocket client in llm component
(fix) Fixes error message for required topics when they can be either/or
(feature) Adds support for RGBD messages (in realsense style)
(feature) Adds async websocket client for roboml
(refactor) Marks child threads as daemons for smoother termination
(feature) Adds break_character to llm component config to handle breaking streaming output into chunks for publishing
(feature) Adds streaming to roboml http client for text data
(feature) Adds streaming output handling to ollama client
(refactor) Adds set_system_prompt to components and removes it from model config The same model can be called with various system prompts by different components
(fix) Fixes typing bugs for for python 3.8 compatibility
Contributors: ahr, aleph-ra, mkabtoul

0.3.3 (2025-01-28)

(fix) Removes python dependencies from package manifest until package names merged in rosdistro
Contributors: ahr

0.3.2 (2025-01-28)

(docs) Updates docs for conversational agent and SpeechToTextConfig
(feature) Adds vad, audio feautres and wakeword classification classes based local onnx models
(feature) Adds utility function for downloading models and status classes for speech processing
(feature) Adds configuration for wakeword detections in speechtotext component
(fix) Fixes error in ollama client where tool calls are received without output content
(fix) Adds a fix to map encoding where it can start with a single detections layer
(refactor) Makes component name non-optional in components to avoid name conflicts
(fix) Fixes error for long prompts when checking if prompt is a filename
(refactor) Removes pytorch as a dependency and runs VAD model with onnxruntime
(refactor) Makes warmup a property of model components that defaults to false
(feature) Adds utility method to download onnx model files
(refactor) Replaces info with debug to reduce logging spam
(fix) Fixes getting logging severity level for jazzy onwards
(fix) Adds minor improvements to branching for llm and mllm components
(chore) Cleansup dependencies for packaging
(chore) Adds dependency for sugar and removes unnecessary python dependencies from packaging
(fix) Corrects import of Topic class
(docs) Removes redefinition of Topic and corrects links to ROS Sugar
(fix) Changes topic in base component to be directly inherited from ROS Sugar for consistency accross packages
(feature) Adds warmup functions to all model based components
(refactor) Removes pillow as a dependancy
(refactor) Removes overrrides from components and adds custom meathods instead
(feature) Adds warmup to vision component for displaying stats on init
(fix) Adds fix for correct colors in cv2 visualization
(fix) Adds node name as window name for visualization in vision component
(feature) Adds cv2 based visualization option to vision component
(refactor) Reduces branching in execution step for components
(chore) Combines agents and agents_interfaces to one package
(chore) Changes deb package name
(fix) Fixes raising error in model initialization for roboml clients
(refactor) Adds passing additional agent types to ros sugar
(fix) Fixes error messages when wrong component inputs/outputs are passed
(feature) Adds support for CompressedImage msg type in components
(feature) Adds option to deploy vision models using tensorrt Works with roboml
(fix) Fixes check on sufficient topics in component validation
(fix) Fixes a bug in topic validation
(fix) Fixes validation of topics in components
(refactor) Changes handling of image messages for publication - Adds support for CompressedImage messages - Gathers image messages directly in vision component instead of getting them back from clients
(feature) Adds frame_id to trackings publisher and updates msg and callback
(feature) Adds boxes to vision tracking message
Contributors: ahr, mkabtoul

0.3.1 (2024-10-29)

(chore) bump version 0.3.0 -> 0.3.1
(feature) Adds support for using tool calling in LLM components in multiprocess execution
Contributors: ahr

0.3.0 (2024-10-28)

(chore) bump version 0.2.0 -> 0.3.0
(chore) Adds bumpver config
Merge pull request #14 from automatika-robotics/feature/external_processors Adds support for running components as separate processes
(docs) Updates docs based on ROS Sugar version update
(fix) Fixes bug in registering triggers with components
(refactor) Simplifies by adding direct serialization of clients and triggers
(refactor) Removes gratuitous logging from utils
(fix) Minor bug fixes for components to run in multiprocessing - Fixes trigger assignment for components - Handles private attributes of attrs classes - Fixes component and config init in common executable
(fix) Fixes serializing log level in clients
(fix) Fixes minor bugs in utils, components, configs and models
(feature) Adds support for running components in multiple processes - Adds common executable to the package for ROS Sugar launcher - Refactors components to be serializable - Adds serialization to clients - Minor type hint changes for compatibility with older versions of ROS
(fix) Adds the correct check for external processors given new ros-sugar implementation
Contributors: ahr

0.2.0 (2024-09-28)

(chore) Bump up the version
Merge pull request #13 from automatika-robotics/feature/better_clients Adds enhanced functionality in clients specifically for LLM and MLLM components
(feature) Adds tool calling for LLM component using the OllamaClient
(fix) Fixes rag results in templated inputs to LLMs which do not contain input
(refactor) Makes named models subclasses of TransformersLLM and TransformersMLLM for easier handling in roboml client
(fix) Fixes key error in ollama client response retreival
(fix) Adds flag for chat history for chat history reset and fixes logging
(feature) Adds TransformersLLM and TransformersMLLM models for roboml clients
(fix) Removes history reset phrase from model definitions and add system prompt for LLMs and derivates
(refactor) Changes model component to have execution step as an abstract method implemented by child components
(fix) Changes ollama client inference call to use chat endpoint
(feature) Adds chat history management to llm and mllm components
(docs) Clarifies handling of RAG results for llm component
(fix) Fixes bug in rag result handling for llm component
(fix) Removes default init_timeout from models
(refactor) Moves roboml resp client dependancies inside the client initialization
(fix) Explicity exposes QoSConfig in ros module
(refactor) Replaces map_meta_data parameter with map_topic for MapEncoding component
(refactor) Removes direct dependancy on pypdf
(fix) Changes map meta data topic to type OccupancyGrid
(feature) Adds audio options to chainlit client
(fix) Removes unused imports
(fix) Fixes the initialization of map encoding and semantic router components
(refactor) Fixes imports and refactors code according to latest version of ROS sugar
(fix) Fixes passing the config in components to parent base component
(fix) Fixes ROS sugar import for BaseTopic
(refactor) Removes auto_ros as a dependency
(feature) Adds init_on_activation flag to all implemented clientsc
(feature) Seperates abstract methods from callable methods in db client base
(feature) Seperates callable methods, from abstract methods in client base class
Contributors: ahr

0.1.1 (2024-09-05)

(feature) Adds component action for adding points to map collection (#12) * Makes version compliant with ROS convention
(chore) Adds license declaration in setup.py
Bumps version number and adds license information
Initial release 0.1.1a
Contributors: ahr, mkabtoul