- Demo deployed and accessible at
/demos/multimodal - Railway service
showcase-langgraph-pythonis healthy OPENAI_API_KEYset on the Railway service (vision calls require it)- Agent is using a vision-capable model (
gpt-4oor equivalent) — verifiable by inspecting the Railway logs forsrc/agents/multimodal_agent.pyor by running one image round-trip - Sample files are bundled under
public/demo-files/:sample.png— a small PNG the vision model can describesample.pdf— a small one-page PDF mentioning "CopilotKit"
- Navigate to
/demos/multimodal - Verify the page header "Multimodal attachments" is visible
- Verify the description text mentions image and PDF attachments
- Verify the sample-row (
[data-testid="multimodal-sample-row"]) is visible - Verify the "Try with sample image" and "Try with sample PDF" buttons are present and enabled
- Verify
<CopilotChat />renders a message composer with a paperclip / "Add attachments" button
- Click "Try with sample image"
- Button label briefly flips to "Loading…"
- Within 5 seconds an image attachment chip appears in the composer with a thumbnail preview
- Type "Describe this image"
- Click send
- Within 60 seconds the agent responds with a description referring to the image content (e.g. mentions a logo, colors, or brand mark)
- Click "Try with sample PDF"
- A document attachment chip appears in the composer (filename "sample.pdf" visible, document icon)
- Type "Summarize this document"
- Click send
- Within 60 seconds the agent responds with text that mentions "CopilotKit" (the sample PDF contains the word multiple times)
- Click the paperclip / "Add attachments" button
- File picker opens filtered to
image/*andapplication/pdf - Select a local image under 10 MB
- Attachment chip renders in the composer within 2 seconds
- Type "What's in this image?" and send
- Agent responds with a description of the image content
- Drag a local image onto the chat container
- Chat surface shows a drop affordance
- On drop, an attachment chip appears
- Sending a prompt works the same as the paperclip path
- Inject the sample image AND the sample PDF (click both buttons sequentially)
- Both chips are visible in the composer
- Type "What do these two attachments have in common?"
- Click send
- Agent response acknowledges both attachments
- Try to attach a file over 10 MB via the paperclip
- Verify
onUploadFailedfires (console warning from the page) and the file is rejected without corrupting the composer - Try to attach an unsupported type (e.g.
.exe) — the file picker filter excludes it oronUploadFailedfires withreason: "invalid-type" - Block
/demo-files/sample.pngin DevTools Network; click "Try with sample image"; themultimodal-sample-errorspan shows a helpful error without crashing the page
- Attachment chip appears within 2 seconds after upload or sample injection
- Thumbnail renders for image attachments; document icon for PDFs
- Agent response arrives within 60 seconds for single-attachment prompts (multimodal tokens are heavier than text-only)
- No console errors during successful flows (warnings from
onUploadFailedduring intentional error cases are acceptable) - Error states are visible and recoverable — the user can retry