登录查看更多内容

Streamlined AI/ML Image Processing with Cloud Functions and Cloud Vision API: Automating Image Annotation at Scale

Anil Kumar

All 11x Google Certified Professional | Multi-Cloud Architect at IBM

发布日期: 2024年10月20日

Introduction:

In the world of artificial intelligence and machine learning (AI/ML), processing and classifying images is a common task for applications in e-commerce, security, and even content moderation. Google Cloud offers a powerful solution to automate image annotation and classification using serverless technologies like Cloud Functions and Cloud Vision API.

This blog will walk you through how to implement a serverless, highly scalable solution for image recognition and classification. We’ll deploy Cloud Functions to handle both individual image uploads via an HTTP-triggered REST API and batch image uploads through Cloud Storage. With Cloud Vision API, we’ll automate the process of labeling and annotating images. The annotations will be stored back in Cloud Storage as JSON, ready for further analysis or downstream consumption.

Overview of the Workflow:

REST API:
Batch Upload:

Step-by-Step Guide

Step 1: Enable Necessary APIs

Before we start coding, make sure you have enabled the required APIs in your Google Cloud project:

gcloud services enable cloudfunctions.googleapis.com vision.googleapis.com storage.googleapis.com

Step 2: Create a Cloud Storage Bucket

We’ll use Cloud Storage for batch image uploads, so create a bucket to store your images.

gsutil mb gs://your-bucket-name

Step 3: Write the Cloud Function to Handle Image Processing

The following Python code defines a Cloud Function that handles both the REST API calls for single image annotations and batch image uploads triggered by Cloud Storage events. It interacts with the Cloud Vision API to annotate the images.

cloud_function_main.py

import os
from google.cloud import vision
from google.cloud import storage
from flask import Flask, request, jsonify

app = Flask(__name__)
vision_client = vision.ImageAnnotatorClient()
storage_client = storage.Client()

def annotate_image(image_content, features=None):
    image = vision.Image(content=image_content)
    
    # Use default feature (label detection) if no specific features are provided
    response = vision_client.annotate_image({
        'image': image,
        'features': features or [{'type': vision.Feature.Type.LABEL_DETECTION}]
    })
    
    return response

# REST API to process single image
@app.route('/process-image', methods=['POST'])
def process_image():
    data = request.get_json()
    image_url = data.get('image_url')

    if not image_url:
        return jsonify({"error": "Image URL not provided"}), 400

    bucket_name, file_name = parse_gcs_url(image_url)
    bucket = storage_client.bucket(bucket_name)
    blob = bucket.blob(file_name)
    image_content = blob.download_as_bytes()

    # Call the annotation function
    features = parse_features(data.get('features'))
    response = annotate_image(image_content, features)

    return jsonify(response)

def parse_gcs_url(gcs_url):
    """Parses a Cloud Storage URL into bucket and file name."""
    gcs_url = gcs_url.replace("gs://", "")
    bucket_name, file_name = gcs_url.split("/", 1)
    return bucket_name, file_name

def parse_features(features_list):
    """Parses and returns specific features for Cloud Vision API."""
    if not features_list:
        return None
    
    features = []
    for feature_name in features_list:
        if feature_name == 'LABEL_DETECTION':
            features.append({'type': vision.Feature.Type.LABEL_DETECTION})
        elif feature_name == 'TEXT_DETECTION':
            features.append({'type': vision.Feature.Type.TEXT_DETECTION})
        # Add more features as needed
    return features

# Cloud Function triggered by Cloud Storage for batch processing
def process_image_from_storage(event, context):
    """Triggered by Cloud Storage when a new file is uploaded."""
    bucket_name = event['bucket']
    file_name = event['name']

    # Download the image from Cloud Storage
    bucket = storage_client.bucket(bucket_name)
    blob = bucket.blob(file_name)
    image_content = blob.download_as_bytes()

    # Annotate the image using Cloud Vision
    response = annotate_image(image_content)

    # Store the annotation as a JSON file in Cloud Storage
    annotation_blob = bucket.blob(f"annotations/{file_name}.json")
    annotation_blob.upload_from_string(response)

    print(f"Processed and stored annotations for {file_name}")

if __name__ == '__main__':
    app.run(host='0.0.0.0', port=8080)

Step 4: Deploy the Cloud Functions

Deploy the REST API Cloud Function for individual image processing:

领英推荐

New Era of Datascience in the Cloud

Michael Spencer 3 年前

AWS Summit New York: AWS Bolsters Its Generative AI…

Shelly DeMotte Kramer 8 个月前

The Future of MLOps: Strategies for Scalable AI in the…

Steven Murhula 1 个月前

gcloud functions deploy process-image \
--runtime python310 \
--trigger-http \
--allow-unauthenticated

2. Deploy the Cloud Function for batch image processing, triggered by Cloud Storage uploads:

gcloud functions deploy process_image_from_storage \
--runtime python310 \
--trigger-resource=your-bucket-name \
--trigger-event=google.storage.object.finalize

Step 5: Test the REST API

Once your Cloud Functions are deployed, you can test the REST API by making a POST request. You can pass the Cloud Storage URL of the image you want to process, along with optional annotation features (like label detection or text detection).

curl -X POST https://REGION-PROJECT_ID.cloudfunctions.net/process-image \
-H "Content-Type: application/json" \
-d '{
  "image_url": "gs://your-bucket-name/your-image.jpg",
  "features": ["LABEL_DETECTION", "TEXT_DETECTION"]
}'

This will return a JSON response containing the image annotations.

Step 6: Test Batch Processing via Cloud Storage

To test the batch processing functionality, simply upload an image to the Cloud Storage bucket.

gsutil cp your-image.jpg gs://your-bucket-name/

The Cloud Function will be triggered automatically, and it will annotate the image using Cloud Vision. The annotations will be saved in the annotations/ folder in the same Cloud Storage bucket as a JSON file.

Terraform Option: Automate Deployment

You can also automate the deployment of this entire solution using Terraform. Google Cloud offers a Jump Start solution that you can directly deploy through the console or via GitHub.

Jump Start Solution Repo: https://github.com/GoogleCloudPlatform/terraform-ml-image-annotation-gcf/tree/sic-jss/infra

To deploy via Terraform:

git clone https://github.com/GoogleCloudPlatform/terraform-ml-image-annotation-gcf.git
cd terraform-ml-image-annotation-gcf/infra
terraform init
terraform apply

Conclusion

By integrating Cloud Functions and the Cloud Vision API, we’ve built a highly scalable, serverless solution for image recognition and annotation. Whether you're handling individual image uploads via a REST API or batch uploads through Cloud Storage, this architecture allows you to automate the entire process with minimal setup. Additionally, the use of Terraform makes deployment quick and efficient, allowing you to focus on building solutions instead of infrastructure management.

This solution is ideal for e-commerce platforms, content moderation systems, or any use case that requires scalable image processing with AI/ML.

要查看或添加评论，请登录

Anil Kumar的更多文章

Revolutionizing Cloud AI with SandboxAQ: A Hands-on Guide to Large Quantitative Models on Google Cloud

2025年1月31日

Revolutionizing Cloud AI with SandboxAQ: A Hands-on Guide to Large Quantitative Models on Google Cloud

Introduction Google Cloud has been at the forefront of cloud computing innovation, constantly enhancing its ecosystem…
Elevating Security in Google Cloud: 7 Best Practices for Modern Workloads

2024年12月2日

Elevating Security in Google Cloud: 7 Best Practices for Modern Workloads

As organizations accelerate their migration to the cloud, securing workloads on platforms like Google Cloud Platform…
Master Google Cloud Databases: Comparing Transactional and Analytical Workloads with Firestore and BigQuery

2024年11月30日

Master Google Cloud Databases: Comparing Transactional and Analytical Workloads with Firestore and BigQuery

As businesses grow and generate vast amounts of data, selecting the right database for your workload is critical. The…
Comprehensive Guide to Google Cloud Databases: Choosing the Right Option for Your Application

2024年11月26日

Comprehensive Guide to Google Cloud Databases: Choosing the Right Option for Your Application

In the modern world of cloud computing, databases are the backbone of any application. Google Cloud, a leader in cloud…
Valuable lesson about the dangers of overconfidence and the importance of preparation, no matter how experienced you are.

2024年8月9日

Valuable lesson about the dangers of overconfidence and the importance of preparation, no matter how experienced you are.

In July 2024, I walked into the exam room with a sense of confidence that bordered on overconfidence. I had taken the…
Why People Aren't Happy in Their IT Professional Life (And What Can Help)

2024年5月8日

Why People Aren't Happy in Their IT Professional Life (And What Can Help)

Investigates the causes of dissatisfaction within the IT industry. The article highlights issues like burnout…
Embeddings and Vector Search for Google Cloud Professionals: A Technical Deep Dive

2024年5月8日

Embeddings and Vector Search for Google Cloud Professionals: A Technical Deep Dive

Introduction In the realm of machine learning and natural language processing (NLP), embeddings and vector search have…
The Ultimate Guide to Acing the Google Cloud Professional Architect Certification

2024年5月7日

The Ultimate Guide to Acing the Google Cloud Professional Architect Certification

Introduction In the booming world of cloud computing, Google Cloud Platform (GCP) is a force to be reckoned with. And…
Navigating the Evolution: Amazon AWS AI Journey

2024年4月7日

Navigating the Evolution: Amazon AWS AI Journey

In the ever-evolving landscape of artificial intelligence (AI), Amazon Web Services (AWS) has emerged as a key player…
Google Cloud Professional Machine Learning Engineer Certification: Your Path to Success with Udemy's Winner Series

2024年2月24日

Google Cloud Professional Machine Learning Engineer Certification: Your Path to Success with Udemy's Winner Series

Google Professional ML Certification Exam Passing Guide Earning the Google Cloud Professional Machine Learning (ML)…

3 条评论

See all articles

Streamlined AI/ML Image Processing with Cloud Functions and Cloud Vision API: Automating Image Annotation at Scale

Anil Kumar

All 11x Google Certified Professional | Multi-Cloud Architect at IBM

Introduction:

Overview of the Workflow:

Step-by-Step Guide

Step 1: Enable Necessary APIs

Step 2: Create a Cloud Storage Bucket

Step 3: Write the Cloud Function to Handle Image Processing

Step 4: Deploy the Cloud Functions

领英推荐

Step 5: Test the REST API

Step 6: Test Batch Processing via Cloud Storage

Terraform Option: Automate Deployment

Conclusion

Anil Kumar的更多文章

社区洞察

其他会员也浏览了

Model Deployment Techniques for Machine Learning Models

AWS re:Invent 2024 – AI, Analytics, Silicon, Storage and Data Observability

Estafet Insights - Edition 9

AWS update of Week 30 (24Jul - 30Jul)

Unlocking the Power of Generative AI with AWS Services

Gen AI Services on AWS: A Three-Layered Approach

Embracing the Future of AI and Cloud Innovation

Develop and Deploy Generative AI Applications on AWS with Eviden’s GenOps Framework - Part 3

Amazon Textract vs. Azure AI Document Intelligence vs. Google Cloud Document AI: My Hands-On Comparison

Introduction:

Overview of the Workflow:

Step-by-Step Guide

Step 1: Enable Necessary APIs

Step 2: Create a Cloud Storage Bucket

Step 3: Write the Cloud Function to Handle Image Processing

Step 4: Deploy the Cloud Functions

领英推荐

Step 5: Test the REST API

Step 6: Test Batch Processing via Cloud Storage

Terraform Option: Automate Deployment

Conclusion

Anil Kumar的更多文章

Revolutionizing Cloud AI with SandboxAQ: A Hands-on Guide to Large Quantitative Models on Google Cloud

Elevating Security in Google Cloud: 7 Best Practices for Modern Workloads

Master Google Cloud Databases: Comparing Transactional and Analytical Workloads with Firestore and BigQuery

Comprehensive Guide to Google Cloud Databases: Choosing the Right Option for Your Application

Valuable lesson about the dangers of overconfidence and the importance of preparation, no matter how experienced you are.

Why People Aren't Happy in Their IT Professional Life (And What Can Help)

Embeddings and Vector Search for Google Cloud Professionals: A Technical Deep Dive

The Ultimate Guide to Acing the Google Cloud Professional Architect Certification

Navigating the Evolution: Amazon AWS AI Journey

Google Cloud Professional Machine Learning Engineer Certification: Your Path to Success with Udemy's Winner Series

社区洞察

其他会员也浏览了

Model Deployment Techniques for Machine Learning Models

AWS re:Invent 2024 – AI, Analytics, Silicon, Storage and Data Observability

Estafet Insights - Edition 9

AWS update of Week 30 (24Jul - 30Jul)

Unlocking the Power of Generative AI with AWS Services

Gen AI Services on AWS: A Three-Layered Approach

Embracing the Future of AI and Cloud Innovation

Develop and Deploy Generative AI Applications on AWS with Eviden’s GenOps Framework - Part 3

Amazon Textract vs. Azure AI Document Intelligence vs. Google Cloud Document AI: My Hands-On Comparison