feat: Add image paste support and key binding #3088

joshrutkowski · 2025-10-03T14:17:06Z

Issue #, if available: #2339

Description of changes:

Current state

Image support required saving a screenshot and referencing as a file in chat

New state

Image from clipboard is copied to a tmp directory and can be referenced directly via CTRL+V keybinding or a new /paste command to be read in chat. Multiple images are supported (in the case of CTRL+V).

CTRL+V example:

[default] 7% > [Image #1]


> I see you've provided a file path to a PNG image. Let me read and analyze it for you.


🛠️  Using tool: fs_read (trusted)
 ⋮
 ● Reading images: /var/folders/1t/qsms0xsd7zx_lsn66s2zxtch0000gr/T/.tmpyh66w1.png

 ✓ Successfully read image

 ⋮
 ● Completed in 0.88s



> This is an adorable gray and white kitten with striking yellow-green eyes. The kitten has:

• A gray coat with white markings on the face (white blaze down the center and white muzzle)
• Large, expressive yellow-green eyes
• Prominent whiskers
• Large, alert ears
• A sweet, innocent expression looking directly at the camera

The photo appears to be a professional or high-quality portrait shot against a white background, capturing the kitten's charming features
beautifully.

Is there something specific you'd like me to help you with regarding this image?

/paste example:

[default] 7% > /paste



> I see you've provided a file path to a PNG image. Let me read it to see what you'd like help with.


🛠️  Using tool: fs_read (trusted)
 ⋮
 ● Reading images: /var/folders/1t/qsms0xsd7zx_lsn66s2zxtch0000gr/T/.tmpVjC68m.png

 ✓ Successfully read image

 ⋮
 ● Completed in 0.70s



> I can see the image shows a black Labrador Retriever with beautiful amber/brown eyes looking directly at the camera against a white background.
It's a professional-looking portrait photo.

How can I help you with this image? Would you like me to:
• Analyze something specific about it?
• Help you process or manipulate it in some way?
• Extract information from it?
• Something else?

Multiple images example:

[default] 7% > [Image #1][Image #2]


> I'll read these image files to see what you're sharing.


🛠️  Using tool: fs_read (trusted)
 ⋮
 ● Reading images: /var/folders/1t/qsms0xsd7zx_lsn66s2zxtch0000gr/T/.tmpInTzqp.png
/var/folders/1t/qsms0xsd7zx_lsn66s2zxtch0000gr/T/.tmpFGCzWa.png

 ✓ Successfully read image

 ⋮
 ● Completed in 0.94s



> I can see you've shared two images:

1. First image: A black Labrador Retriever with beautiful amber/brown eyes, looking directly at the camera with a calm, gentle expression.

2. Second image: A gray and white kitten with striking yellow-green eyes, also looking at the camera with an alert, curious expression.

Both are professional-looking portrait photos with white backgrounds. Is there something specific you'd like me to help you with regarding these
images?

Help details (showing paste command)

[default] 7% > /help

q (Amazon Q Chat)

Usage: /<COMMAND>

Commands:
  quit        Quit the application
  clear       Clear the conversation history
  agent       Manage agents
  context     Manage context files for the chat session
  editor      Open $EDITOR (defaults to vi) to compose a prompt
  reply       Open $EDITOR with the most recent assistant message quoted for reply
  compact     Summarize the conversation to free up context space
  tools       View tools and permissions
  issue       Create a new Github issue or make a feature request
  logdump     Create a zip file with logs for support investigation
  changelog   View changelog for Amazon Q CLI
  prompts     View and retrieve prompts
  hooks       View context hooks
  usage       Show current session's context window usage
  mcp         See mcp server loaded
  model       Select a model for the current conversation session
  experiment  Toggle experimental features
  subscribe   Upgrade to a Q Developer Pro subscription for increased query limits
  save        Save the current conversation
  load        Load a previous conversation
  todos       View, manage, and resume to-do lists
  paste       Paste an image from clipboard
  help        Print this message or the help of the given subcommand(s)

Options:
  -h, --help
          Print help (see a summary with '-h')

Error cases

Large images (> 10MB)

Too many images (>10)

No image

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…-q-developer-cli into paste-image

crates/chat-cli/src/cli/chat/util/clipboard.rs

brandonskiser · 2025-10-03T17:24:25Z

crates/chat-cli/src/cli/chat/prompt.rs

+#[derive(Debug)]
+struct PasteStateInner {
+    paths: Vec<PathBuf>,
+    count: usize,


Why is there a separate paths and count?

The count field is used to generate the marker text [Image #N] but this probably can be simplified with paths.len()

crates/chat-cli/src/cli/chat/util/clipboard.rs

brandonskiser · 2025-10-03T21:55:26Z

crates/chat-cli/src/cli/chat/util/clipboard.rs

+    })?;
+
+    // Try to guess format from raw bytes, fallback to PNG
+    let format = guess_format(&image_data.bytes).unwrap_or(ImageFormat::Png);


This looks like a regression from the previous implementation? Checking the clipboard docs it looks like clipboard content is always encoded as a list of rgba values - https://docs.rs/arboard/latest/arboard/struct.ImageData.html

This will always just be png compatible, no?

Good callout - let me go back to that approach. You're right, it's always rgba values

crates/chat-cli/src/cli/chat/mod.rs

brandonskiser · 2025-10-06T17:35:59Z

crates/chat-cli/src/cli/chat/util/images.rs

Unless I'm missing something, these tests don't seem to be verifying anything relevant in this PR? What are these doing?

Adding more tests to existing image handling, primarily. If not desired, I can remove

dingfeli · 2025-10-15T18:42:52Z

Hi Josh. Thanks for the contribution. If I understand this correctly it looks this is:

creating a temporary version of images that are in the clipboard
pasting the paths to these temp files
asking the model to use fs read to view these images

I think the api client already support image types so we can probably just use that instead of relying on another round trip. What do you think?

joshrutkowski and others added 2 commits October 3, 2025 07:07

feat: Add image paste support and key binding

0635fbb

Merge branch 'main' into paste-image

967b620

joshrutkowski marked this pull request as ready for review October 3, 2025 14:18

joshrutkowski and others added 4 commits October 3, 2025 09:30

Merge branch 'main' into paste-image

dbb05ae

fix: Run fmt

897659f

Merge branch 'paste-image' of https://github.com/joshrutkowski/amazon…

90ba721

…-q-developer-cli into paste-image

fix: multiple imports

238bfa3

kkashilk reviewed Oct 3, 2025

View reviewed changes

crates/chat-cli/src/cli/chat/util/clipboard.rs Outdated Show resolved Hide resolved

fix: Add replace png with image crate

3a51b90

kkashilk approved these changes Oct 3, 2025

View reviewed changes

brandonskiser reviewed Oct 3, 2025

View reviewed changes

fix: simplify approach

44c24f3

brandonskiser reviewed Oct 6, 2025

View reviewed changes

brandonskiser approved these changes Oct 20, 2025

View reviewed changes

kensave approved these changes Oct 21, 2025

View reviewed changes

Merge branch 'main' into paste-image

9d007b1

kkashilk merged commit b6f7819 into aws:main Oct 21, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add image paste support and key binding #3088

feat: Add image paste support and key binding #3088

joshrutkowski commented Oct 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

brandonskiser Oct 3, 2025

Uh oh!

joshrutkowski Oct 3, 2025

Uh oh!

Uh oh!

brandonskiser Oct 3, 2025

Uh oh!

joshrutkowski Oct 3, 2025

Uh oh!

Uh oh!

brandonskiser Oct 6, 2025

Uh oh!

joshrutkowski Oct 7, 2025

Uh oh!

dingfeli commented Oct 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

feat: Add image paste support and key binding #3088

feat: Add image paste support and key binding #3088

Conversation

joshrutkowski commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Current state

New state

Error cases

Uh oh!

Uh oh!

brandonskiser Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

joshrutkowski Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

brandonskiser Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

joshrutkowski Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

brandonskiser Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

joshrutkowski Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

dingfeli commented Oct 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

joshrutkowski commented Oct 3, 2025 •

edited

Loading