feat: add multimodal support to AnthropicChatGenerator #2186

ChinmayBansal · 2025-08-12T22:41:24Z

Related Issues

fixes Image support in AnthropicChatGenerator #2125

Proposed Changes:

This PR adds multimodal support to AnthropicChatGenerator, enabling it to handle both text and image inputs
in user messages. The implementation follows the same patterns established by the HuggingFace and Amazon
Bedrock integrations.

Key changes:

Modified _convert_messages_to_anthropic_format() to handle ImageContent alongside TextContent
Added proper Anthropic vision API format using ImageBlockParam with base64 image encoding
Implemented content order preservation for mixed text/image messages
Added validation to prevent images in assistant messages (Anthropic API limitation)
Updated component docstring with multimodal usage example
Added comprehensive type safety with proper Anthropic types

Technical details:

Uses ImageBlockParam with proper media type casting for Anthropic's vision API
Supports all Anthropic-compatible image formats: JPEG, PNG, GIF, WebP
Maintains backward compatibility with existing text-only functionality
Integrates seamlessly with existing prompt caching and tool calling features

How did you test it?

Unit Tests:

✅ All existing unit tests pass (49 passed, 4 skipped)
✅ Added test_convert_message_to_anthropic_format_with_image to verify proper message conversion
✅ Added validation test to ensure images in assistant messages raise appropriate errors

Integration Tests:

✅ Added test_live_run_multimodal for real API testing (requires ANTHROPIC_API_KEY)
✅ Uses test image from shared test assets (apple.jpg)

Code Quality:

✅ Linting passes (hatch run fmt)
✅ Type checking passes (hatch run test:types)
✅ Follows existing code patterns and conventions

Manual Verification:

Tested with Claude Sonnet 3.5 using various image types
Verified proper error handling for unsupported scenarios
Confirmed multimodal messages work with different content ordering

Notes for the reviewer

The implementation closely follows the Amazon Bedrock integration and HuggingFace integration patterns for consistency
Image validation ensures only user messages can contain images (Anthropic API requirement)
Type annotations use proper Anthropic ImageBlockParam types for full type safety
Test image (apple.jpg) is shared from Bedrock integration's test assets
The cast() function is used for media type conversion to satisfy Anthropic's strict typing

Checklist

I have read the contributors
guidelines and the
code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for
my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.

Suggested PR Title (using conventional commits):
feat: add multimodal support to AnthropicChatGenerator

CLAassistant · 2025-08-12T22:41:30Z

All committers have signed the CLA.

anakin87

Thanks for this contribution!

This PR is already good. I left some comments.

...s/anthropic/src/haystack_integrations/components/generators/anthropic/chat/chat_generator.py

integrations/anthropic/tests/test_chat_generator.py

...s/anthropic/src/haystack_integrations/components/generators/anthropic/chat/chat_generator.py

integrations/anthropic/tests/test_chat_generator.py

anakin87

Thanks!

Add multimodal support to AnthropicChatGenerator deepset-ai#2125

b4ece9c

ChinmayBansal requested a review from a team as a code owner August 12, 2025 22:41

ChinmayBansal requested review from mpangrazzi and removed request for a team August 12, 2025 22:41

github-actions bot added integration:anthropic type:documentation Improvements or additions to documentation labels Aug 12, 2025

fix: resolve import formatting and line length issues

9afb293

ChinmayBansal changed the title ~~feat: add multimodal support to AnthropicChatGeneratorc~~ feat: add multimodal support to AnthropicChatGenerator Aug 12, 2025

anakin87 self-requested a review August 13, 2025 07:30

anakin87 requested changes Aug 13, 2025

View reviewed changes

Fixed recommended changes deepset-ai#2186: anthropic image support

94c9783

anakin87 requested changes Aug 14, 2025

View reviewed changes

ChinmayBansal and others added 3 commits August 14, 2025 23:41

Fix mime type validation

42ec9a6

refinements

c138a4c

Merge branch 'main' into feat-anthropic-multimodal-support

5bdb1d0

anakin87 approved these changes Aug 18, 2025

View reviewed changes

anakin87 merged commit 66a8d9b into deepset-ai:main Aug 18, 2025
11 checks passed

ChinmayBansal deleted the feat-anthropic-multimodal-support branch August 18, 2025 17:16

ChinmayBansal mentioned this pull request Aug 19, 2025

feat: add image support to LlamaCppChatGenerator #2197

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add multimodal support to AnthropicChatGenerator #2186

feat: add multimodal support to AnthropicChatGenerator #2186

Uh oh!

ChinmayBansal commented Aug 12, 2025

Uh oh!

CLAassistant commented Aug 12, 2025 •

edited

Loading

Uh oh!

anakin87 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

anakin87 left a comment

Uh oh!

Uh oh!

Uh oh!

feat: add multimodal support to AnthropicChatGenerator #2186

feat: add multimodal support to AnthropicChatGenerator #2186

Uh oh!

Conversation

ChinmayBansal commented Aug 12, 2025

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

Uh oh!

CLAassistant commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anakin87 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

anakin87 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

CLAassistant commented Aug 12, 2025 •

edited

Loading