4 New Gemini Features You Can’t Afford to Miss


How many of you have wished AI could do more than just answer questions? I know I have, and as of late, I’m amazed by how it’s transforming. AI chatbots aren’t just about chatting anymore, they’re about creating, researching, and even listening to information in a whole new way. At the heart of this transformation is Google Gemini, an advanced AI assistant designed to streamline how we work with text, code, and audio. Whether you’re a developer, researcher, or content creator, Gemini introduces powerful tools like Canvas, Audio Overview, and Deep Research, making it easier than ever to write, code, and absorb complex information. In this article, we’ll be exploring all of these new features and even try out Gemini’s new Personalization Experimental Mode.

Latest Gemini Features

Gemini has recently introduced innovative updates designed to enhance productivity and creativity like never before. These new features make it easier to create, collaborate, and consume information efficiently. Here’s what’s new:

  • Canvas: Gemini’s Canvas is a real‑time workspace for drafting and editing both documents and code, featuring live previews and simple export options.
  • Audio Overview: Turn long documents and reports into engaging, podcast-style audio summaries for easy, on‑the‑go consumption.
  • Deep Research Integration: Gemini now offers comprehensive, AI‑powered research reports on any topic, providing detailed written insights that help you explore complex subjects from multiple perspectives.
  • Personalized Experimental Mode: Customize Gemini to your unique preferences by leveraging your Google data (like Search, YouTube, Maps, and Photos) so that its responses evolve with your habits, becoming a truly personal digital companion.

With these powerful new features, Gemini is no longer just a chatbot, it’s a complete productivity suite. Whether you’re writing, coding, researching, or learning, these advancements make creativity and efficiency more accessible than ever.

Now let’s explore these advancements one by one. First, let’s start with Canvas in Gemini.

Gemini Canvas

Canvas is Gemini’s dedicated creative space designed for both writers and developers. It’s a dynamic, real-time workspace where you can:

  • Draft and Edit Documents: Generate first drafts using AI assistance, then fine-tune your content by adjusting tone, length, or formatting, all while watching your changes appear instantly.
  • Develop and Preview Code: For developers, Canvas isn’t just a text editor. It includes features to generate code (HTML, React, and more) and provides live previews so you can see how your prototype or app functions as you iterate.
  • Collaborate Seamlessly: Whether you’re working solo or with a team, Canvas makes it easy to export your polished work like sending your document directly to Google Docs facilitating smooth collaboration and sharing.

How to Access Canvas in Gemini

Canvas is designed to be easily accessible on the web, mobile phone, and directly via the URL. Here’s how to do it:

1. Via the Gemini Web App:

  • Open Gemini on your desktop at gemini.google.com.
  • Locate the “Canvas” button in the prompt bar and click it to launch the workspace.

2. On Mobile Devices:

While mobile functionality is available, note that full editing tools for text styling and formatting are optimized for the desktop experience.

3. Direct URL Access:

Alternatively, you can jump directly to your Canvas projects by visiting the website.

This streamlined approach ensures that you can dive into your creative projects quickly and efficiently, no matter where you are.

What Can You Do with Canvas in Gemini?

Canvas opens up a world of possibilities for various users:

  • For Writers and Content Creators:
    • Generate drafts, refine essays, and create engaging reports or blog posts.
    • Use AI-powered tools to adjust your text’s tone, length, and formatting on the fly.
  • For Developers:
    • Create working prototypes of web apps or scripts, generate HTML or React code, and see live previews of your designs.
    • Iteratively improve your code with real-time feedback from Gemini.
  • For Educators and Students:
    • Develop study guides, presentations, and collaborative projects that can be seamlessly exported to Google Docs for group work.

The intuitive design of Canvas makes it a versatile tool, perfectly suited for rapid prototyping, creative writing, and interactive coding projects all within one cohesive platform.

Now let’s use Canvas. Here are the 3 tasks I will be performing to explore Gemini Canvas:

  1. Writing and Editing Content
  2. Writing and Editing Code
  3. Data Analysis and Visualization

Task 1: Writing and Editing an Article on Gemini Canvas

First, let’s try generating an article on Gemini and editing it using Canvas. Here, I’m going to prompt the chatbot to “write an article on AI”.

From the video, we can see that Gemini has written the article and on the right side, it opens up the Canvas window, offering various editing options such as selecting heading, bold, italic, bullet points, copy, share, and even export to docs. On the bottom right, it also gives us the option of shortening the length of the article or changing its tone.

Task 2: Writing and Editing Code on Gemini Canvas

Now let’s try out how coding works on Gemini’s Canvas. Here’s the prompt I used for this:

“Create a simple Dynamic Color Palette Generator using JavaScript, HTML, and CSS. The generator should have a button that, when clicked, randomly changes the background color and updates a set of four color swatches. Use CSS variables to dynamically update the colors. Also, display the hex codes of the generated colors below each swatch. The UI should be minimal and visually appealing.”

And here’s the response:

From the video, we can see that a color palette has been generated, and the colors change as we click on the ‘Generate Palette’ button. The generated swatches update dynamically, and the hex codes are displayed below each swatch. The Canvas editor allowed for easy modifications and real-time previews, enhancing the development process.

Task 3: Data Analysis on Gemini Canvas

For our last task, let’s give Gemini some data analysis to do. We’ll ask the chatbot to analyse the input data and plot a graph based on the analysis.

Prompt: “Analyse the benchmark in the provided PDF and plot a graph for the analysis”

Input file: https://arxiv.org/pdf/2408.00118

Here’s the response from Gemini:

From the video, we see that Gemini did the analysis, but did not provide any graph in the first attempt. So, I had to prompt it again to get an interactive graph of the analysis made. The final graph provided clear visual insights, and interactive features making it easier to interpret the data effectively.

Gemini Canvas vs ChatGPT Canvas Comparison

OpenAI had already introduced the Canvas feature on ChatGPT last year, providing users with an interactive workspace for text and document editing. It offers a similar interface and functionality, allowing seamless collaboration and AI-assisted content generation, coding, and document editing.

While both Gemini and ChatGPT have embraced the Canvas concept, there are notable differences in their approach and functionality:

Feature Gemini Canvas ChatGPT Canvas 
Integration Seamlessly integrated with Google’s ecosystem, allowing for easy export to Google Docs and smooth collaboration. Focuses on document creation within the ChatGPT environment; not directly integrated with Google services.
Features Supports both document editing and live code previews, making it a versatile workspace for writers and developers alike. Primarily focused on text-based collaboration, offering features like suggesting edits and adjusting text complexity.
Real-Time Feedback Built with real-time interactivity, providing instant changes and AI-powered suggestions as you work. Uses a split-view interface, with a conversational pane for collaboration and AI-generated suggestions.
Target Audience Available to Gemini users across different plans, integrated deeply with Google’s AI ecosystem. Currently available only for ChatGPT Enterprise, Pro, and Plus users, catering mainly to professional and business-oriented workflows.
Productivity Tools Includes live previews for code, AI-driven writing suggestions, and easy document exports, making it useful for both coding and content creation. Offers editing assistance, text adjustments, and version history tracking, but lacks extensive live code preview functions found in Gemini Canvas.

Also Read: Let’s Try Coding with OpenAI Canvas

Audio Overview on Gemini: Turn Documents into Podcasts

Now that we have covered the Canvas feature, let’s talk about how we can turn documents into podcasts using Google Gemini’s new Audio Overview.

Audio Overview is an innovative feature in Gemini that transforms static documents, slides, and reports into engaging podcast-style audio summaries. Instead of reading through lengthy content, you can simply listen to a dynamic, well-curated audio version, making it much easier to absorb complex information on the go. This tool is perfect for busy professionals and students alike, as it turns dense material into an accessible, hands-free learning experience.

Let’s go through the steps to use this feature effectively:

Step 1: Access Gemini
Open the Gemini app on your desktop by visiting gemini.google.com.

Step 2: Load Your Document
Either upload the document (PDF, slides, or text file) or copy and paste your content into the workspace.

Step 3: Select Audio Overview
Select “Generate Audio Overview” option in the prompt bar or as a suggestion chip above your content.

Note that this feature works only for Gemini 2.0 Flash and Gemini 2.0 Flash Thinking so select the model accordingly.

Gemini Audio Overview

Also Read: How to Access Google Gemini 2.0 Models for Free?

Step 4: Generate the Audio Summary
Now let Gemini process your document. The AI will convert the text into a podcast-style audio summary.

Gemini Audio Overview 1

Step 5: Download or Share
Download the audio file or share it directly from the platform.

Gemini Audio Overview 2

These steps should help you seamlessly convert static documents into engaging audio content using Gemini’s Audio Overview feature.

Here’s a video sowing how you can use Gemini’s Audio Overview feature, below which you can listen to the audio that Gemini generated.

As you have just heard, Gemini’s Audio Overview sounds just like a podcast. It was quite surprising that although I didn’t mention anything specific in my prompt, Gemini went on to generate a lively conversation between two people, discussing the topic of the pdf, making it really engaging!

Deep Research Integration

Deep Research, previously available as a specific model named ‘Gemini 1.5 Pro with Deep Research’ has now been integrated directly into the Gemini interface. Users can access this feature by selecting the ‘Deep Research’ button in the prompt bar or through the model picker dropdown, enabling them to utilize advanced models for comprehensive research tasks.

This integration allows Gemini to break down complex queries into actionable research steps, gathering data from diverse sources to generate comprehensive, multi-page reports. Whether you’re a student, researcher, or professional, this tool streamlines your research process by providing thorough, accurate findings. Additionally, it can convert these reports into engaging audio summaries, offering a versatile learning experience.

Here’s a step-by-step guide to using Gemini’s Deep Research Integration for AI-powered insights:

Step 1: Open Gemini and Select Deep Research Mode
Launch the Gemini app on your desktop (e.g., via gemini.google.com). Look for the “Deep Research” option in the prompt bar or menu.

Google chatbot models

Step 2: Enter Your Research Query
Type in the topic or question you want to explore. Be as detailed as possible to help Gemini break down your query into actionable research steps.

Gemini Deep Research
Gemini Deep Research 1

You can see in the image that Gemini gives you the option of editing the plan if required, before starting the research.

Step 3: Let Gemini Compile Information
When we click on start research Gemini will process the query, gather data from its vast sources, and generate a comprehensive research report. This report will include detailed insights, analysis, and relevant context.

Gemini Deep Research 2

Step 4: Review and Refine the Content
Once the content is generated, read through it to ensure it covers all the aspects you need. You can ask Gemini for clarifications or additional details on any section.

In the video, you can see that the article has been written properly, along with the links to all the source websites listed at the end.

Now let’s move on to the one remaining feature that’s new to Gemini i.e. Personalized Experimental.

Personalized Experimental in Gemini

Imagine having an AI assistant that truly understands you – not just answering your questions but adapting to your preferences, interests, and workflow. That’s exactly what Gemini’s Personalized Experimental Mode aims to achieve.

When enabled, this feature allows Gemini to tap into your Google ecosystem including Search, YouTube, Maps, and Photos to provide responses that feel more context-aware and relevant to you. Instead of generic answers, Gemini refines its suggestions based on your habits, making it an AI assistant that evolves alongside you.

For example, if you frequently search for coding tutorials, Gemini might adjust its responses to provide more detailed explanations or suggest relevant projects. If you’re a traveler, it could offer personalized trip recommendations based on your past searches and locations. Even the tone and depth of its responses can be customized to match your preferred style.

You can access the Personalized Experiment from here:

Google chatbot models

And here’s a video of the response generated by Gemini’s Personalization model when I asked it to “Generate a travel itinerary for my next 3-day weekend trip to Goa based on my past travel interests.”

While this feature is still experimental, Google is actively refining it to balance personalization with strong privacy controls.

With this shift toward adaptive AI, Gemini is growing from just an assistant, to becoming a personalized digital companion that learns, evolves, and enhances your productivity over time.

Conclusion

In conclusion, Gemini is no longer just an AI chatbot, it has evolved into a powerful productivity suite that enhances writing, coding, research, and personalization. With features like Canvas for real-time editing, Audio Overview for transforming text into audio, and Deep Research for in-depth insights, Gemini streamlines complex tasks and makes AI more interactive and efficient.

The addition of Personalized Experimental Mode further customizes the experience, adapting to individual user preferences. As AI continues to advance, Gemini stands at the forefront of this transformation, redefining how we create, collaborate, and consume information. Its seamless integration with Google’s ecosystem makes it an essential tool for anyone looking to boost efficiency and creativity in their daily workflow.

Frequently Asked Questions

Q1. What is Gemini, and how is it different from other AI models?

A. Gemini is Google’s advanced multimodal AI assistant, capable of handling text, code, images, and audio. Unlike traditional chatbots, it includes powerful tools like Canvas, Audio Overview, and Deep Research to enhance productivity and creativity.

Q2. How do I access Canvas in Gemini?

A. You can access Canvas through the Gemini web app by visiting gemini.google.com and selecting the Canvas option in the prompt bar. It provides a real-time interactive workspace for writing and coding.

Q3. What is Audio Overview, and how does it work?

A. Audio Overview allows you to convert documents, slides, and research reports into podcast-style audio summaries. This feature helps you absorb complex information on the go by transforming static text into engaging spoken content.

Q4. How does Gemini’s Deep Research Integration help with research?

A. Deep Research gathers in-depth insights from multiple sources and generates comprehensive research reports. You can also convert these reports into Audio Overviews for an immersive learning experience.

Q5. What is Personalized Experimental Mode in Gemini?

A. This experimental feature enables Gemini to personalize responses based on your Google ecosystem (Search, YouTube, Maps, etc.), making recommendations and generating content that aligns with your preferences.

Q6. Is Gemini Canvas better than ChatGPT’s Canvas?

A. Both have unique strengths. Gemini Canvas is deeply integrated with Google’s ecosystem, supporting live previews for coding and seamless document exports. ChatGPT’s Canvas is more focused on text-based collaboration and is currently available only to enterprise users.

Q7. Can I use Gemini for coding projects?

A. Yes! With Canvas, you can write and preview HTML, JavaScript, and React code in real-time. Gemini can also help you debug, refactor, and enhance your code dynamically.

Hi, I am Janvi, a passionate data science enthusiast currently working at Analytics Vidhya. My journey into the world of data began with a deep curiosity about how we can extract meaningful insights from complex datasets.

Login to continue reading and enjoy expert-curated content.

We will be happy to hear your thoughts

Leave a reply

Som2ny Network
Logo
Compare items
  • Total (0)
Compare
0