Changelog

Follow up on the latest improvements and updates.

RSS

This release focuses on bug fixes and other improvements
  • Fixed: chat streaming freezes or is very slow
  • Fixed:
    gpt
    inline trigger doesn't work if OpenAI is not the default service
  • Fixed: AI Command pallete doesn't stay on top
  • Fixed: not discovering
    beta
    updates
  • Attemp to fix the issue where BoltAI would crash when going back from background
  • Fixed the issue where prompt stuck in "continue mode"
BedrockSettings
  • Official supports for Amazon Bedrock & xAI
  • Set custom cache breakpoint when using Anthropic models
  • Convert temporary chat to normal chat
  • Added support for o1 models on Azure
  • Document Analysis now supports for more file types
  • Improved scroll-to-bottom behavior, faster list view
  • Improved drag-and-drop experience
  • Fixed the issue where control+n / control+p for non-standard keyboard layout
  • Fixed the issue where app icon is reverted after update
  • Fixed issue where cannot set up Advanced Voice Mode without OpenAI API key
  • Fixed issue with Web Search plugin for some Anthropic users
This release improves the AI Command experience
CleanShot 2024-11-27 at 20
  • Improved the AI command keyboard handling. You can now use control+n or control+p to navigate.
  • Improved input handling for Japanese input source
  • Improved AI Command performance, faster search & scrolling
  • Fixed an issue with toggling the AI Command window

new

fixed

v1.27.0

The Document Analysis feature just got improved.
CleanShot 2024-11-09 at 11
  • New:
    Claude's native PDF capabilities
  • New:
    BoltAI supports OCR now. Right-click on a PDF and select "Reprocess with OCR"
  • New:
    Choose where to attach documents in the chat (attach on individual messages or to the System Prompt)
  • Fixed
    : Fixed the issue where BoltAI would not open a new chat when the sidebar is collapsed
Claude's native PDF capabilities
If you're using Claude Sonnet 3.5, you can use the new native PDF capabilities in BoltAI. Find the option in the right sidebar (screenshot below).
Note that when native PDF capabilities option is enabled, Prompt Caching will be temporarily disabled.
CleanShot 2024-11-07 at 23
OCR support
For PDFs with images and tables, you can "reprocess" the document using OCR for better accuracy. Currently, BoltAI only support OCR for PDF documents.
CleanShot 2024-11-09 at 21
More fine-grain settings for Document Analysis feature
You can choose to attach document content to individual messages or to the System Prompt.
  • When attaching to the System Prompt, you can also choose to apply Prompt Caching automatically.
  • When attaching to each individual message, Prompt Caching is temporarily disabled.

new

improved

v1.26.0

CleanShot 2024-11-01 at 01
  • New: Gemini models support native Google Search Tool.
  • Advanced Voice Mode supports more voices and with caching
  • Custom keyboard shortcuts for Voice Input & Advanced Voice Mode
  • Improved: Better Voice Input UI & error handling
What's new?
Grounding with Google Search
BoltAI now supports the Grounding with Google Search feature for Gemini Models. To enable it, open the plugin list and find the "Google Search (Gemini Only)".
Note that it might be more expensive than using the current Web Search plugins provided by BoltAI ($35 per 1K grounding request).
BoltAI doesn't support
mode
and
dynamic_threshold
settings in this release yet.
New realtime voices
Great news. You can now more realtime voices. It's also cheaper now as OpenAI applied prompt caching automatically.
Enjoy!
CleanShot 2024-11-01 at 17
Better Voice Input UI
I've made a few of improvements to the Voice Input UI:
  • Support custom keyboard shortcut
  • Added the ability to cancel the Voice Input session
  • Retry if the transcription fails
CleanShot 2024-11-01 at 17
And that's it for now 👋
Happy belated halloween 🎃
  • New AI Command Behavior: Open in a new temporary window
  • New: New chat window is floating by default
  • New: Open chat messages in browser
What's New?
New Behavior: Open in a new temporary window
When using an AI Command, you can choose to open it in a new temporary window. No need to clean up inline chats aftereward.
You can bulk edit the command's behavior to this one.
CleanShot 2024-10-29 at 18
Floating Chat Windows
A single chat window is floating by default. You can access it in any Space.
Open chat messages in browser
If you have a custom markdown viewer, you can use this to view the message in your favorite web browser.
CleanShot 2024-10-29 at 18
* Thanks Adam, AR, Hans, Anıl and others for the feature suggestion.

improved

fixed

v1.25.2

CleanShot 2024-10-28 at 15
  • Improved: Setting spell check & auto-correct options in the chat input field
  • Fixed: Inline Whisper not working for some users
Advanced Voice Mode is here.
CleanShot 2024-10-25 at 01
  • New: Advanced Voice Mode using OpenAI's Realtime API.
  • New: New UI for the Voice Input
  • New: Read Aloud streams and plays audio in real time
What's new?
Advanced Voice mode
The long awaited Advanced Voice Mode (AVM) is released. You can now have a voice conversation with GPT-4o via Realtime API.
BoltAI supports both OpenAI and Azure OpenAI Service.
New UI for Voice Input
Voice Input UI just got an upgrade. Much better than the previous version, right?
CleanShot 2024-10-26 at 02
Better Read Aloud
The Read Aloud feature now streams and plays audio in real time. It's much faster and more robust.
And that's it for now. Have a wonderful weekend 👋
This release focuses on improving the voice input & Inline Whisper experience.
CleanShot 2024-10-16 at 20
  • New: Use Whisper from Groq with the new model "whisper-large-v3-turbo". Go to
    Settings > Advanced > Voice Settings
    to configure.
  • New: Added suppport for custom Whisper prompt
  • New: Whisper AI plugin now supports custom host & Organization ID
  • New (Inline Whisper): Added the option to copy transcription to clipboard
  • Improved: You can configure the keyboard shortcut for toggling the chat configuration pane (Toggle Inspector)
boltai-doc-analysis-o1
  • New: The o1 series models now support Document Analysis
  • New: Reworked the chat input field. Fixed layout issues when pasting long prompts.
  • New: Estimate token & cost for Gemini models
  • Improved: Keyboard shortcut for toggling chat configuration pane (Toggle Inspector:
    cmd+I
    )
  • Fixed: Cannot set up custom OpenAI-compatible server
  • Fixed: Cannot use emojis in chat titles
  • Fixed: Fixed an issue where the app could hang when loading user's profile picture
What's new?
This release adds some great improvement.
You can use o1 series models for Document Analysis now
.
The OpenAI's new o1 model doesn't support custom System Instruction. In this release, I put your custom system instruction and document content into the user prompt.
The chat input field got a huge upgrade.
I switched to AppKit's NSTextView and this fixes all the layout issues when pasting long prompts. It's also a lot faster too!
Estimate costs for Gemini models.
Gemini flash 8B is my new favorite small model. It's cheap and is quite capable for simple tasks such as summarization or grammar fixes...
In this release, I've added token & cost estimation for Gemini models.
CleanShot 2024-10-13 at 13
New BoltAI Workspace.
This is not related to mac app, but I've been working on the new BoltAI Workspace. It's the cloud version of BoltAI with Cloud Sync & other collaboration features.
It's not ready yet but you can join the waitlist here: https://app.youform.com/forms/kyxy1r54
CleanShot 2024-10-09 at 11
Have a nice weekend.
See you in the next update 👋
Load More