简体   繁体   English

Google Cloud Vision API 批量图像注释器

[英]Google Cloud Vision API batch Image Annotater

I have a directory with almost 200 images from which I want to get the image properties using the google-cloud-vision API.我有一个包含近 200 张图像的目录,我想使用google-cloud-vision API 从中获取image properties

I have a problem making a batch request for those images.我在为这些图像发出批处理请求时遇到问题。 The code posted is a tutorial from the official site.发布的代码是来自官方网站的教程。 But it's using gs:// as source for it's images.但它使用gs://作为其图像的来源。

I want to use my local directory, but can't figure how to do it.我想使用我的本地目录,但不知道该怎么做。 The second is what I have wrote.第二个是我写的。 I have tried to load the images with an imageLoader but failed.我尝试使用imageLoader加载图像但失败了。

Please help!请帮忙!

from google.cloud import vision_v1

def sample_async_batch_annotate_images(
    """Perform async batch image annotation."""
    client = vision_v1.ImageAnnotatorClient()

    source = {"image_uri": input_image_uri}
    image = {"source": source}
    features = [
        {"type_": vision_v1.Feature.Type.LABEL_DETECTION},
        {"type_": vision_v1.Feature.Type.IMAGE_PROPERTIES},

    # Each requests element corresponds to a single image.  To annotate more
    # images, create a request element for each image and add it to
    # the array of requests
    requests = [{"image": image, "features": features}]
    gcs_destination = {"uri": output_uri}

    # The max number of responses to output in each JSON file
    batch_size = 2
    output_config = {"gcs_destination": gcs_destination,
                     "batch_size": batch_size}

    operation = client.async_batch_annotate_images(requests=requests, output_config=output_config)

    print("Waiting for operation to complete...")
    response = operation.result(90)

    # The output is written to GCS with the provided output_uri as prefix
    gcs_output_uri = response.output_config.gcs_destination.uri
    print("Output written to GCS with prefix: {}".format(gcs_output_uri))

I have tried to simply changing the path to a local directory, but got an error that it's must be a gs:// then I startet to load the images in imgs with a imageLoader but then got this error TypeError: Cannot set google.cloud.vision.v1.ImageSource.image_uri to [<PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=960x817 at 0x1237E0CD0我试图简单地将路径更改为本地目录,但得到一个错误,它必须是gs://然后我开始使用imageLoader加载imgs中的图像,但随后出现此错误TypeError: Cannot set google.cloud.vision.v1.ImageSource.image_uri to [<PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=960x817 at 0x1237E0CD0

import io
import os

from google.cloud import vision_v1

from os import listdir
from PIL import Image as PImage

os.environ["GOOGLE_APPLICATION_CREDENTIALS"] = r"/Users/example/Documents/Project/ServiceAccountTolken.json"

def loadImages(path):
    # return array of images

    imagesList = listdir(path)
    loadedImages = []
    for image in imagesList:
        img = PImage.open(path + image)

    return loadedImages

path = "/Users/Documents/Project/images/"

# your images in an array
imgs = loadImages(path)

# its a bit messy here since I tried some options

def sample_async_batch_annotate_images(
    input_image_uri = imgs, #os.path.abspath("/Users/Documents/Project/images"),
    output_uri = os.path.abspath("/Users/Documents/Project/images/output"),
    """Perform async batch image annotation."""
    client = vision_v1.ImageAnnotatorClient()

    source = {"image_uri": input_image_uri}
    image = {"source": source}
    features = [
        {"type_": vision_v1.Feature.Type.IMAGE_PROPERTIES},

    # Each requests element corresponds to a single image.  To annotate more
    # images, create a request element for each image and add it to
    # the array of requests
    requests = [{"image": image, "features": features}]
    gcs_destination = {"uri": output_uri}

    # The max number of responses to output in each JSON file
    batch_size = 6
    output_config = {"gcs_destination": gcs_destination,
                     "batch_size": batch_size}

    operation = client.async_batch_annotate_images(requests=requests, output_config=output_config)

    print("Waiting for operation to complete...")
    response = operation.result(90)

    # The output is written to GCS with the provided output_uri as prefix
    gcs_output_uri = response.output_config.gcs_destination.uri
    print("Output written to GCS with prefix: {}".format(gcs_output_uri))

async_batch_annotate_images() does not support reading local files. async_batch_annotate_images()不支持读取本地文件。 You need to use batch_annotate_images() instead.您需要改用batch_annotate_images() To read local images, you need to read image content as bytes and pass it to the request.要读取本地图像,您需要以字节形式读取图像内容并将其传递给请求。 Code below includes saving the response.json to a GCS bucket.下面的代码包括将response.json保存到 GCS 存储桶。 But if you won't be needing to save it to GCS, uncomment the part where it saves the file locally.但是,如果您不需要将其保存到 GCS,请取消注释它在本地保存文件的部分。

See code below:请参见下面的代码:

import io
import os

from google.cloud import vision_v1

from os import listdir
import proto
from google.cloud import storage
#os.environ["GOOGLE_APPLICATION_CREDENTIALS"] = r"/Users/example/Documents/Project/ServiceAccountTolken.json"

def loadImages(path):
    # return array of bytes

    imagesList = listdir(path)
    loadedImages = []

    for image in imagesList:
        with io.open(path+image, 'rb') as image:
            content = image.read()

    return loadedImages

path = "/home/user/your_local_path"

# your images in an array
contents = loadImages(path)

def batch_annotate(
    contents = contents,
    """Perform async batch image annotation."""
    client = vision_v1.ImageAnnotatorClient()

    bucket_name = "your-bucket-name"
    destination_blob_name = "response.json"

    storage_client = storage.Client()
    bucket = storage_client.bucket(bucket_name)
    blob = bucket.blob(destination_blob_name)

    requests = []

    for content in contents:
        image = {"content": content}
        features = [
            {"type_": vision_v1.Feature.Type.IMAGE_PROPERTIES},
        requests.append({"image": image, "features": features})

    response = client.batch_annotate_images(requests=requests,)
    to_text = proto.Message.to_json(response) # convert object to text

    # uncomment if you want to save the response to your local directory
    """f = open('response.json', 'w')
    for data in to_text:

    file_obj = io.StringIO(to_text)


See saved result in GCS:在 GCS 中查看保存的结果:


Sample snippet of the response.json : response.json的示例片段:


声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

粤ICP备18138465号  © 2020-2024 STACKOOM.COM