登录查看更多内容

Building Projects With Chrome's On-device AI

Parthipan N.

Full-stack Developer | JavaScript | TypeScript | Python | MERN | AWS | PostgreSQL

发布日期: 2024年12月9日

+ 关注

Using the experimental prompt API on Chrome to build prototypes with AI?features

On-device / Edge?AI

On-device AI refers to AI models that run directly on end-user devices, such as smartphones, tablets, or IoT gadgets, without relying on cloud computing or a server to host these models.

This is useful in many ways:

Since the model is on the device, we can run offline inferences.
We can reduce the operational costs of running AI features by offloading certain inferences to the client devices.
Since the data never leaves the device, we can offer more privacy and data security with on-device models.

However, since these models are run on memory-constrained devices, they can’t perform general-purpose inferences that a Large Language Model hosted in the cloud could do. Instead, these are smaller models with specific capabilities.

Chrome ships with one such model. Let’s take a look at it:

Gemini Nano in?Chrome

The latest version of Google Chrome ships with an on-device AI model, the Gemini-nano. However, the APIs interacting with it are experimental and behind a flag.

So if we intend to use the experimental API, we’ll first need to enable this feature flag through the following steps:

Update to the latest version of Chrome and then visit chrome://flags?.
Search for Prompt API for Gemini Nano
Enable the flag
Restart the browser

Building Applications with Chrome’s On-device AI

Once the feature is enabled, we can access the model from a global object as follows:

window.ai

The Prompt?API

We can create a session with a system prompt as follows:

const inferenceSession = await window.ai.languageModel.create({
  systemPrompt: `You are an English teacher. 
                 Analyse a given word and come up with a sentence to demonstrate the usage of the word.
                 Always respond in English.`
});

Once the inference session is created we can invoke the prompt method on it as follows:

await inferenceSession.prompt('Precarious');

Execution result of the Prompt API in Chrome

A Sample?Project

Let’s build our above idea into a simple web application. The system design for our project can be architected as shown below:

Our final product will be as follows:

领英推荐

How Does Google Use Artificial Intelligence (AI)?

Bernard Marr 3 年前

Seeed Monthly Wrap-Up for January: Explore Machine…

Seeed Studio 1 年前

The AI Stack

Prof. Ahmed Banafa 9 个月前

To keep the focus of the article on AI integration let’s look only at how that part of the code is composed:

The link to the GitHub repository with the complete code is at the bottom of this article.

The AI Helper?Methods

The module design we have for this utility can be depicted as shown in the image below:

We can implement the above with the following code:

// src/utils/ai.js

export async function setupAI() {
  if(!window.ai?.languageModel){
    throw new Error("AI feature is not enabled on this browser.");
  }
  const inferenceSession = await window.ai.languageModel.create({
    systemPrompt: `You are an English teacher. For a given word and come up with a sentence to demonstrate the usage of the word.
    Always respond in English in the following format: 
    <h3>Usage:</h3> <p>Your sentence here</p>
    <h3>Meaning:</h3>  <p>The meaning of the word</p>
    `,
  });
  return inferenceSession;
};

export async function prompt(inferenceSession, word){
    const response = await inferenceSession.prompt(word);
    return response;
}

Notice the system prompt, where we instruct the model to return the response as HTML elements. This is to simplify our application logic. If we were to deploy this app, it would be a good idea to sanitize and validate the response before injecting it into the DOM. Since this is just a proof of concept, we can skip that part in this context.

Setting Up the Inference Session on Content?Load

The on-load control flow is as follows:

Which could be implemented with the following logic:

// main.js

import { setupAI } from "./src/utils/ai.js";

const initUI = () => {
  // ... code to initilize the user interface
};

let inferenceSession = null;

document.addEventListener("DOMContentLoaded", async () => {
  try{
    inferenceSession = await setupAI();
    initUI();
  }catch(error){
    console.error(error);
    alert("App failed to load. Please check the console for more details.");
  }
});

Prompting for Word Usage and Definition

The inference control flow can be visualized as below:

We could implement this logic as follows:

// main.js
import { setupAI, prompt } from "./src/utils/ai.js";

const initUI = () => {
  // ... existing code
  setupButtons(document.querySelector("#button-container"), {
    onSubmit: () => {
      const trimmedValue = input.value.trim();

      if (trimmedValue) {
        updateTitle(trimmedValue.charAt(0).toUpperCase() + trimmedValue.slice(1));
        updateContent(`
        <p>Asking the AI for the word usage instructions...Please wait...</p>
      `);

      prompt(inferenceSession, trimmedValue)
        .then((response) => {
          updateContent(`
          <div>${parseBold(response)}</div>
        `);
      })
        .catch((error) => {
          updateContent(`
          <p>Failed to get the usage instructions. Please try again.</p>
        `);
        console.error(error);
        });
      }
    },

  // ... exisitng code
  });

}

// ... existing code

Since it is only a proof of concept, we intentionally skipped input validations and checks, when a user enters a word and clicks on `Submit`.?

GitHub Repositories

The complete functional code for this demo can be accessed from this GitHub repository:

If you are interested in exploring a bit more sophisticated application built using this on-device AI model and Svelte, this hobby project of mine might interest you:

This article was originally published on my Medium blog.

Parthipan N.

3 个月

Medium Blog link: https://levelup.gitconnected.com/building-projects-with-chromes-on-device-ai-4b66a5ec7fc1

1 次回应

要查看或添加评论，请登录

Parthipan N.的更多文章

Computer Science Basics Series: Square Root Approximation Techniques

2025年1月16日

Computer Science Basics Series: Square Root Approximation Techniques

In this article let’s look at some square root approximation algorithms and techniques. We will start with a basic…
Verify Slack Signature By Implementing a Custom Middleware in Sinatra

2024年12月31日

Verify Slack Signature By Implementing a Custom Middleware in Sinatra

Middleware in a Sinatra application intercepts, processes, and modifies HTTP requests and responses. It is useful for…

1 条评论
An Introduction to Alternate Data Streams?(ADS)

2024年12月2日

An Introduction to Alternate Data Streams?(ADS)

A Hidden Layer of New Technology File System (NTFS) Alternate Data Streams (ADS) is a New Technology File System (NTFS)…

1 条评论
The Art and Science of Developer Onboarding: Insights from Google

2024年9月16日

The Art and Science of Developer Onboarding: Insights from Google

Key Insights from “Developer Productivity for Humans” Studies Published by Google In the software industry, the journey…
Getting Started With Generators in JS

2024年9月14日

Getting Started With Generators in JS

A Hands-on Guide to Writing Your First Generator Function in JavaScript Generators in JavaScript are a powerful feature…
Chromic5 - Colour Theory Made Easy for Python Programmers.

2016年6月2日

Chromic5 - Colour Theory Made Easy for Python Programmers.

Colors are everywhere, but choosing the right colors that get along well with each other and at the same time, elevate…

See all articles

Building Projects With Chrome's On-device AI

Parthipan N.

Full-stack Developer | JavaScript | TypeScript | Python | MERN | AWS | PostgreSQL

Using the experimental prompt API on Chrome to build prototypes with AI?features

On-device / Edge?AI

Gemini Nano in?Chrome

Building Applications with Chrome’s On-device AI

The Prompt?API

A Sample?Project

领英推荐

The AI Helper?Methods

Setting Up the Inference Session on Content?Load

Prompting for Word Usage and Definition

GitHub Repositories

Parthipan N.的更多文章

社区洞察

其他会员也浏览了

The Rise of TinyML: Machine Learning for Embedded Systems

Accelerating Enterprise AI Workloads with an AI Platform

The AI Alignment Paradox and More

DeepSeek's Success: A 'Validation of Apple Intelligence'

ML Trends in 2022

Demystify emerging edge, fog, and cloud computing

How AI is changing the rules for Software and Hardware design

The Coming Wave of AI Operating Systems

Edge AI: Transforming Personalization and Recommendations for the Future

Exploring the Tiny AI: Innovations, Challenges, and Self-Replication Implications

Using the experimental prompt API on Chrome to build prototypes with AI?features

On-device / Edge?AI

Gemini Nano in?Chrome

Building Applications with Chrome’s On-device AI

The Prompt?API

A Sample?Project

领英推荐

The AI Helper?Methods

Setting Up the Inference Session on Content?Load

Prompting for Word Usage and Definition

GitHub Repositories

Parthipan N.的更多文章

Computer Science Basics Series: Square Root Approximation Techniques

Verify Slack Signature By Implementing a Custom Middleware in Sinatra

An Introduction to Alternate Data Streams?(ADS)

The Art and Science of Developer Onboarding: Insights from Google

Getting Started With Generators in JS

Chromic5 - Colour Theory Made Easy for Python Programmers.

社区洞察

其他会员也浏览了

The Rise of TinyML: Machine Learning for Embedded Systems

Accelerating Enterprise AI Workloads with an AI Platform

The AI Alignment Paradox and More

DeepSeek's Success: A 'Validation of Apple Intelligence'

ML Trends in 2022

Demystify emerging edge, fog, and cloud computing

How AI is changing the rules for Software and Hardware design

The Coming Wave of AI Operating Systems

Edge AI: Transforming Personalization and Recommendations for the Future

Exploring the Tiny AI: Innovations, Challenges, and Self-Replication Implications