News from the AI & ML world

DeeperML - #aimemory

@Simon Willison's Weblog //
Google has released QAT (Quantization-Aware Training) optimized versions of its Gemma 3 models, aiming to make these powerful language models more accessible. By using quantization, the models are significantly reduced in size, allowing them to run on consumer-grade GPUs and even mobile devices. This approach dramatically reduces memory requirements while maintaining high quality, opening up possibilities for running large language models locally.

The key to this accessibility is Quantization-Aware Training (QAT), which simulates lower bit widths during training. This allows the model to adapt to these limits and minimize the performance drop typically associated with lower precision. Google reports significant model size reductions, with the Gemma 3 27B model dropping from 54GB to 14.1GB when quantized to int4 format. Similar reductions are seen across the Gemma 3 family, with the 12B model shrinking to 6.6GB, the 4B model to 2.6GB, and the 1B model to a mere 0.5GB.

Google partnered with Ollama, LM Studio, MLX, and llama.cpp to facilitate the use of these quantized models. Simon Willison reports that the "gemma3:27b-it-qat" model, requiring 22GB of RAM, has become a favorite local model on his Mac, accessible from his phone via Ollama, Open WebUI, and Tailscale. Willison also noted the snappily titled "Gemma-3-27b-it-qat-q4_0-gguf" sounds like a Wi-Fi password, but is in fact Google’s leanest LLM yet.

Recommended read:
References :
  • bsky.app: I think the snappily titled "gemma3:27b-it-qat" may be my new favorite local model - needs 22GB of RAM on my Mac (I'm running it via Ollama, Open WebUI and Tailscale so I can access it from my phone too) and so far it seems extremely capable
  • Simon Willison's Weblog: Gemma 3 QAT Models
  • the-decoder.com: Gemma-3-27b-it-qat-q4_0-gguf sounds like a Wi-Fi password but it’s Google’s leanest LLM yet

@the-decoder.com //
OpenAI has launched a significant update to ChatGPT, enhancing its memory capabilities to include the entire history of user conversations. This allows the AI model to draw on past interactions, providing more personalized and relevant responses. The upgrade is designed to make ChatGPT a more adaptable and long-term tool, evolving with users and understanding their preferences over time, across various modalities including text, voice, and image interactions.

ChatGPT Plus and Pro users will be the first to access the new memory feature, with rollout planned for Team, Enterprise, and Edu accounts in the coming weeks. The improved memory system includes two key components: "Reference saved memories," where users can explicitly direct ChatGPT to remember specific facts like names or preferences, and "Reference chat history," which allows the model to use context from prior conversations to adapt to a user's tone, goals, and interests. Users retain control over their information and can choose to disable the memory function entirely or set limitations on how ChatGPT references previous conversations.

With the update, OpenAI aims to create a more seamless and context-aware experience for users. ChatGPT can now draw more naturally on past conversations, even in new chats, leading to more helpful and personalized responses. This enhancement positions ChatGPT alongside other digital assistants, moving towards more versatile companions powered by generative AI. The rollout excludes countries in the European Economic Area, the UK, Switzerland, Norway, Iceland, and Liechtenstein.

Recommended read:
References :
  • the-decoder.com: OpenAI expands ChatGPT's memory to include full conversation history
  • AI News | VentureBeat: ChatGPT’s memory can now reference all past conversations, not just what you tell it to
  • www.tomsguide.com: ChatGPT just got a huge memory upgrade — here's why it's a big deal
  • Maginative: ChatGPT can now remember your past conversations
  • Search Engine Journal: ChatGPT Memory Update: Remembers Info Across All Chats
  • THE DECODER: OpenAI expands ChatGPT's memory to include full conversation history
  • www.theguardian.com: OpenAI countersues Elon Musk over ‘unlawful harassment’ of company
  • PCMag Middle East ai: ChatGPT Has Receipts, Will Now Remember Everything You've Ever Told It
  • analyticsindiamag.com: Now ChatGPT Knows What You Told It Last Summer
  • www.zdnet.com: ChatGPT will remember everything you tell it now - like a real personal assistant
  • AI News: OpenAI counter-sues Elon Musk for attempts to ‘take down’ AI rival
  • shellypalmer.com: Until now, ChatGPT’s "Memory" feature could retain a handful of user-provided facts to personalize responses. Yesterday, OpenAI announced a new feature you will either dearly love or truly hate: ChatGPT can now reference your entire chat history across every conversation you've ever had with it — not just a few saved facts.
  • Shelly Palmer: ChatGPT's new memory feature allows it to reference the entire conversation history across all conversations.
  • thetechbasic.com: ChatGPT Now Remembers All Your Chats With Major Memory Upgrade
  • The Tech Portal: ChatGPT Will Now Remember Past Conversations
  • The Tech Basic: ChatGPT has the ability to recall absolutely everything you discuss with it. ChatGPT performs this capability in its current state.
  • Digital Information World: OpenAI just a major expansion to its customization and memory feature for ChatGPT. For many users, this means now being able to recall data from past conversations and also adjusting replies depending on that data.

Alexey Shabanov@TestingCatalog //
OpenAI is developing new features for ChatGPT, including "Moonshine" memory, which allows the AI to recall past conversations for more context-aware interactions. A notification feed is also in the works, designed to keep users informed about new features and announcements. The company is also working on a "Whisper" button on the web app, enabling voice dictation, a feature already available on mobile and desktop versions.

Another upcoming feature is a "reasoning slider," allowing users to control the effort the model puts into completing tasks. Options like "think a little" or "think harder" will simplify choices for users unfamiliar with technical model differences. While these features are not yet fully available, some users have reported early access to the Whisper button and Moonshine memory.

The company announced GPT-4o's new image generation capabilities, intending to replace DALL-E as the default image generation model. Due to high demand from subscribers, OpenAI has delayed the rollout of the GPT-4o image generation feature to free users. The feature, which allows for more precise and accurate image creation, was enthusiastically adopted by subscribers, prompting the delay for wider availability.

Recommended read:
References :
  • Data Phoenix: OpenAI delays the new GPT-4o image generation feature for free users due to high demand
  • TestingCatalog: OpenAI prepares reasoning slider and memory update for ChatGPT users

@the-decoder.com //
Google's Gemini AI now possesses the ability to remember past conversations, allowing for more personalized and context-aware interactions. This feature is currently available to paying subscribers and enables Gemini to recall user preferences and past discussions, enhancing its capacity to provide relevant and coherent responses. Users can now ask Gemini to provide a summary of past discussions, eliminating the need to start from scratch or search for previous threads.

The new memory feature, which extends beyond simply remembering preferences to recalling entire conversations, is currently available in English for Gemini Advanced subscribers on the web and mobile. Google says that users must “check responses for accuracy.” Users have the option to review, delete, or adjust how long Gemini retains their chats, and can also disable all Gemini app activities through the MyActivity tab. In the coming weeks, it will be expanded to more languages and Google Workspace Business and Enterprise customers.

Recommended read:
References :
  • PCMag Middle East ai: Now, Gemini recalls not just your preferences but entire conversations. Google's Gemini AI can now recall past conversations, meaning you can ask Gemini to provide a summary of your past discussions on a topic.
  • techstrong.ai: Google to Spend $75 Billion on AI, Cloud Investment
  • the-decoder.com: Google's Gemini chatbot can now recall previous conversations, but only for paying subscribers. The feature aims to make interactions more personal by remembering user preferences and context over time.
  • shellypalmer.com: It’s Valentine’s Day—did you forget the chocolates? Fear not, because if you’re a Google Gemini Advanced user, forgetting is yesterday's problem.
  • THE DECODER: Google's Gemini chatbot can now recall previous conversations, but only for paying subscribers. The feature aims to make interactions more personal by remembering user preferences and context over time.
  • www.ghacks.net: Gemini Advanced Introduces Chat Recall Feature — Here's What It Means for You
  • Google Workspace Updates: Gemini Deep Research and experimental models now available to Google Workspace users in Gemini Advanced