Tensorflow摄像头物体实时识别

官⽅源码提供了图⽚的检测，但是实⽤性不⾼，所以对源码进⾏了修改，使⽤笔记本⾃带摄像头或者usb摄像头进⾏实时检测。第⼆张是摄像头实时物体识别

参考源

I:\Anaconda\StudyTensorflow\models\research>I:/Anaconda/librl/protoc-3.4.0-win32/bin/protoc

object_detection/protos/*.proto --python_out=.

环境变量设置：

在指定的环境中的⽬录下

（D:\ProgramData\Anaconda3\envs\tensorflow\Lib\site-packages

），添加tensorflow_model.pth⽂件，内容如下：

I:\Anaconda\StudyTensorflow\models\research\slim

I:\Anaconda\StudyTensorflow\models\research

Opencv对应的python 3.5版本下载地址

⽂件名：opencv_python-3.4.1-cp35-cp35m-win_amd64.whl

安装：pip install opencv_python-3.4.1-cp35-cp35m-win_amd64.whl

视频监控识别的替换和更改：

180度旋转：

image_np = cv2.flip(image_np, 0)

添加判断：

第⼀部分为旧版，⽐较流畅

第⼆部分为新版，有些卡顿

下⾯的代码可以放到⽬标⽂档直接运⾏(此代码名object_detection_converted.py)

⽂件路径：./models/research/object_detection/object_detection_converted.py

防压又降压的食物

# Licensed under the Apache License, Version 2.0 (the "License");

# you may not use this file except in compliance with the License.

# You may obtain a copy of the License at

# /licenses/LICENSE-2.0

# Unless required by applicable law or agreed to in writing, software

# distributed under the License is distributed on an "AS IS" BASIS,

# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.

# See the License for the specific language governing permissions and

# limitations under the License.

# ============================================================================== """A set of functions that are used for visualization.

These functions often receive an image, perform some visualization on the image.

The functions do not return a value, instead they modify the image itself.

"""

import collections

import functools

# Set headless-friendly backend.

# import matplotlib; matplotlib.use('Agg') # pylint: disable=multiple-statements

import matplotlib.pyplot as plt # pylint: disable=g-import-not-at-top

import numpy as np

import PIL.Image as Image

import PIL.ImageColor as ImageColor

import PIL.ImageDraw as ImageDraw

import PIL.ImageFont as ImageFont

import six

import tensorflow as tf

from import standard_fields as fields

_TITLE_LEFT_MARGIN = 10

_TITLE_TOP_MARGIN = 10

STANDARD_COLORS = [

'AliceBlue', 'Chartreuse', 'Aqua', 'Aquamarine', 'Azure', 'Beige', 'Bisque',

'BlanchedAlmond', 'BlueViolet', 'BurlyWood', 'CadetBlue', 'AntiqueWhite',

'Chocolate', 'Coral', 'CornflowerBlue', 'Cornsilk', 'Crimson', 'Cyan',

'DarkCyan', 'DarkGoldenRod', 'DarkGrey', 'DarkKhaki', 'DarkOrange',

'DarkOrchid', 'DarkSalmon', 'DarkSeaGreen', 'DarkTurquoise', 'DarkViolet',

'DeepPink', 'DeepSkyBlue', 'DodgerBlue', 'FireBrick', 'FloralWhite',

'ForestGreen', 'Fuchsia', 'Gainsboro', 'GhostWhite', 'Gold', 'GoldenRod',

'Salmon', 'Tan', 'HoneyDew', 'HotPink', 'IndianRed', 'Ivory', 'Khaki',

'Lavender', 'LavenderBlush', 'LawnGreen', 'LemonChiffon', 'LightBlue',

'LightCoral', 'LightCyan', 'LightGoldenRodYellow', 'LightGray', 'LightGrey',

'LightGreen', 'LightPink', 'LightSalmon', 'LightSeaGreen', 'LightSkyBlue',

'LightSlateGray', 'LightSlateGrey', 'LightSteelBlue', 'LightYellow', 'Lime',

'LimeGreen', 'Linen', 'Magenta', 'MediumAquaMarine', 'MediumOrchid',

'MediumPurple', 'MediumSeaGreen', 'MediumSlateBlue', 'MediumSpringGreen', 'MediumTurquoise

', 'MediumVioletRed', 'MintCream', 'MistyRose', 'Moccasin', 'NavajoWhite', 'OldLace', 'Olive', 'OliveDrab', 'Orange', 'OrangeRed',

'Orchid', 'PaleGoldenRod', 'PaleGreen', 'PaleTurquoise', 'PaleVioletRed',

'PapayaWhip', 'PeachPuff', 'Peru', 'Pink', 'Plum', 'PowderBlue', 'Purple',

'Red', 'RosyBrown', 'RoyalBlue', 'SaddleBrown', 'Green', 'SandyBrown',

'SeaGreen', 'SeaShell', 'Sienna', 'Silver', 'SkyBlue', 'SlateBlue',

'SlateGray', 'SlateGrey', 'Snow', 'SpringGreen', 'SteelBlue', 'GreenYellow',

'Teal', 'Thistle', 'Tomato', 'Turquoise', 'Violet', 'Wheat', 'White',

'WhiteSmoke', 'Yellow', 'YellowGreen'

]

def save_image_array_as_png(image, output_path):

"""Saves an image (represented as a numpy array) to PNG.

Args:

image: a numpy array with shape [height, width, 3].

output_path: path to which image should be written.

"""

image_pil = Image.fromarray(np.uint8(image)).convert('RGB')

with tf.gfile.Open(output_path, 'w') as fid:

image_pil.save(fid, 'PNG')

def encode_image_array_as_png_str(image):

"""Encodes a numpy array into a PNG string.

Args:

image: a numpy array with shape [height, width, 3].

Returns:

PNG encoded image string.

"""

image_pil = Image.fromarray(np.uint8(image))

output = six.BytesIO()

image_pil.save(output, format='PNG')

png_string = value()

output.close()

return png_string

def draw_bounding_box_on_image_array(image,

ymin,

xmin,

ymax,

xmax,

color='red',

thickness=4,

display_str_list=(),

use_normalized_coordinates=True):

释小龙资料"""Adds a bounding box to an image (numpy array).

Bounding box coordinates can be specified in either absolute (pixel) or

normalized coordinates by setting the use_normalized_coordinates argument.

Args:

image: a numpy array with shape [height, width, 3].

ymin: ymin of bounding box.

xmin: xmin of bounding box.

ymax: ymax of bounding box.

xmax: xmax of bounding box.

color: color to draw bounding box. Default is red.

thickness: line thickness. Default value is 4.

display_str_list: list of strings to display in box

(each to be shown on its own line).

use_normalized_coordinates: If True (default), treat coordinates

ymin, xmin, ymax, xmax as relative to the image. Otherwise treat

coordinates as absolute.

"""

image_pil = Image.fromarray(np.uint8(image)).convert('RGB')

draw_bounding_box_on_image(image_pil, ymin, xmin, ymax, xmax, color,

thickness, display_str_list,

use_normalized_coordinates)

def draw_bounding_box_on_image(image,

ymin,

xmin,

ymax,

xmax,

color='red',

thickness=4,

display_str_list=(),

use_normalized_coordinates=True):

"""Adds a bounding box to an image.

Bounding box coordinates can be specified in either absolute (pixel) or

normalized coordinates by setting the use_normalized_coordinates argument.

Each string in display_str_list is displayed on a separate line above the

bounding box in black text on a rectangle filled with the input 'color'.

If the top of the bounding box extends to the edge of the image, the strings

are displayed below the bounding box.

Args:

image: a PIL.Image object.

ymin: ymin of bounding box.

xmin: xmin of bounding box.

ymax: ymax of bounding box.

xmax: xmax of bounding box.

color: color to draw bounding box. Default is red.

thickness: line thickness. Default value is 4.

display_str_list: list of strings to display in box

(each to be shown on its own line).

use_normalized_coordinates: If True (default), treat coordinates

ymin, xmin, ymax, xmax as relative to the image. Otherwise treat

coordinates as absolute.

"""

draw = ImageDraw.Draw(image)

im_width, im_height = image.size

if use_normalized_coordinates:

(left, right, top, bottom) = (xmin * im_width, xmax * im_width,

ymin * im_height, ymax * im_height)

else:

(left, right, top, bottom) = (xmin, xmax, ymin, ymax)

draw.line([(left, top), (left, bottom), (right, bottom),

(right, top), (left, top)], width=thickness, fill=color)

try:

font = uetype('f', 24)

except IOError:

font = ImageFont.load_default()

# If the total height of the display strings added to the top of the bounding

# box exceeds the top of the image, stack the strings below the bounding box # instead of above.怎么去水印

display_str_heights = [size(ds)[1] for ds in display_str_list]王思懿老公

# Each display_str has a top and bottom margin of 0.05x.

total_display_str_height = (1 + 2 * 0.05) * sum(display_str_heights)

if top > total_display_str_height:

text_bottom = top

else:

text_bottom = bottom + total_display_str_height

# Reverse list and print from bottom to top.

for display_str in display_str_list[::-1]:

text_width, text_height = size(display_str)

margin = np.ceil(0.05 * text_height)

[(left, text_bottom - text_height - 2 * margin), (left + text_width,

text_bottom)],

fill=color)

(left + margin, text_bottom - text_height - margin),

display_str,梅艳芳写真

fill='black',

font=font)

text_bottom -= text_height - 2 * margin

def draw_bounding_boxes_on_image_array(image,

boxes,

color='red',

thickness=4,

display_str_list_list=()):

"""Draws bounding boxes on image (numpy array).

Args:

image: a numpy array object.

boxes: a 2 dimensional numpy array of [N, 4]: (ymin, xmin, ymax, xmax).

乐嘉的儿子

The coordinates are in normalized format between [0, 1].

color: color to draw bounding box. Default is red.

thickness: line thickness. Default value is 4.

display_str_list_list: list of list of strings.

a list of strings for each bounding box.

The reason to pass a list of strings for a

bounding box is that it might contain

multiple labels.

Raises:

ValueError: if boxes is not a [N, 4] array

"""

image_pil = Image.fromarray(image)

draw_bounding_boxes_on_image(image_pil, boxes, color, thickness,

display_str_list_list)

Tensorflow摄像头物体实时识别

发布评论取消回复

最近发表

热门文章

标签列表