Python脚本中TensorFlow GPU加速配置指南-Python教程-PHP中文网

Python脚本中TensorFlow GPU加速配置指南

心靈之曲

发布： 2025-12-02 13:38:02

原创

377人浏览过

Python脚本中TensorFlow GPU加速配置指南

简要说明如何在python脚本中启用tensorflow的gpu加速功能，即使`tf.test.is_gpu_available`返回true，也可能需要额外配置。文章将指导用户检查兼容性、配置gpu内存增长，并提供示例代码，以确保深度学习模型充分利用gpu性能，解决cpu运行缓慢的问题。

1. 引言：深度学习与GPU加速

在进行深度学习模型训练或推理时，图形处理器（GPU）凭借其强大的并行计算能力，能够显著提升处理速度，远超中央处理器（CPU）。然而，许多开发者会遇到一个常见问题：尽管系统已正确安装NVIDIA驱动、CUDA Toolkit和cuDNN，并且TensorFlow也能检测到GPU（tf.test.is_gpu_available()返回True），但实际运行的Python脚本仍旧在CPU上执行，导致性能瓶颈。这通常是由于TensorFlow的GPU配置未完全到位所致。本教程将详细介绍如何确保TensorFlow充分利用GPU资源。

2. GPU加速的前提条件与兼容性检查

在使用GPU加速之前，必须确保以下关键组件已正确安装并兼容：

NVIDIA驱动程序： 确保安装了与您的GPU型号兼容的最新NVIDIA驱动。
CUDA Toolkit： 这是NVIDIA提供的并行计算平台和编程模型，TensorFlow需要它来与GPU通信。
cuDNN： NVIDIA深度神经网络库，是CUDA的加速库，专门为深度学习任务优化。
TensorFlow版本兼容性： TensorFlow的不同版本对CUDA Toolkit和cuDNN的版本有严格的兼容性要求。务必查阅TensorFlow官方文档（例如TensorFlow安装指南），确认您安装的TensorFlow版本与CUDA/cuDNN版本匹配。版本不兼容是导致GPU无法使用的最常见原因之一。

需要注意的是，即使tf.test.is_gpu_available()返回True，也仅表示TensorFlow能够检测到系统中的GPU设备，并识别出CUDA环境。但这并不意味着您的所有模型操作都会自动在GPU上执行，尤其是在未进行显式配置的情况下。

3. 配置TensorFlow以利用GPU

为了确保TensorFlow模型能够实际运行在GPU上，我们需要在Python脚本中进行明确的配置，最关键的一步是设置GPU内存增长（Memory Growth）。

立即学习“Python免费学习笔记（深入）”；

默认情况下，TensorFlow可能会尝试在启动时一次性分配所有可用的GPU内存。这可能导致以下问题：

Otter.ai

一个自动的会议记录和笔记工具，会议内容生成和实时转录

查看详情

如果系统中有其他进程（如显示服务器、其他深度学习应用）也需要GPU内存，可能会导致冲突或内存不足。
在多GPU环境中，这可能导致资源分配不均。

通过启用内存增长，TensorFlow会根据需要动态分配GPU内存，而不是一次性占用全部，从而提高灵活性和与其他GPU进程的兼容性。

以下是在Python脚本中配置TensorFlow GPU的推荐方法：

import tensorflow as tf

# 确保在导入其他依赖（如Keras模型）之前执行此配置
# 检查物理GPU设备
physical_devices = tf.config.list_physical_devices('GPU')

if len(physical_devices) > 0:
    print(f"检测到 {len(physical_devices)} 个GPU设备。")
    try:
        # 遍历所有检测到的GPU设备
        for gpu in physical_devices:
            # 启用GPU内存增长
            tf.config.experimental.set_memory_growth(gpu, True)
            print(f"已为设备 {gpu} 启用内存增长。")
        print("TensorFlow已成功配置为使用GPU。")
    except RuntimeError as e:
        # 捕获并打印可能出现的运行时错误
        print(f"配置GPU时发生错误: {e}")
else:
    print("未检测到GPU设备。TensorFlow将运行在CPU上。")

# 此后，您可以加载Keras模型或执行其他TensorFlow操作
# 例如:
# from keras.models import load_model
# model = load_model('your_model.h5')
# ...

登录后复制

将配置代码集成到您的脚本中：

将上述GPU配置代码段放置在您的Python脚本中，紧随 import tensorflow 语句之后，且在任何涉及Keras模型加载或TensorFlow计算之前。

例如，对于原始问题中提供的脚本，您应该将其放置在 import tensorflow as tf 之后，但在加载Keras模型 model = load_model('model_1.h5') 之前。

import tkinter as tk
from tkinter import messagebox
from PIL import Image, ImageTk
import cv2
import numpy as np
import face_recognition
import os
import imutils
import time
from imutils.video import VideoStream
from keras.models import load_model
from keras.preprocessing import image
import tensorflow as tf # TensorFlow 导入在这里

# --- GPU 配置代码开始 ---
physical_devices = tf.config.list_physical_devices('GPU')
if len(physical_devices) > 0:
    print(f"检测到 {len(physical_devices)} 个GPU设备。")
    try:
        for gpu in physical_devices:
            tf.config.experimental.set_memory_growth(gpu, True)
            print(f"已为设备 {gpu} 启用内存增长。")
        print("TensorFlow已成功配置为使用GPU。")
    except RuntimeError as e:
        print(f"配置GPU时发生错误: {e}")
else:
    print("未检测到GPU设备。TensorFlow将运行在CPU上。")
# --- GPU 配置代码结束 ---

# Tkinter penceresini oluştur
root = tk.Tk()
root.title("Yüz Tanıma ve Duygu Analizi")

# Frame'leri oluştur
main_frame = tk.Frame(root)
main_frame.pack()

left_frame = tk.Frame(main_frame)
left_frame.pack(side=tk.LEFT)

right_frame = tk.Frame(main_frame)
right_frame.pack(side=tk.RIGHT)

# Kamera görüntüsü için etiket
video_label = tk.Label(left_frame)
video_label.pack()

# Yüz Tanıma ve Duygu Analizi etiketi
label = tk.Label(right_frame, text="Yüz Tanıma ve Duygu Analizi", font=("Helvetica", 16))
label.pack()

# Uygulamayı kapatacak buton
close_button = tk.Button(right_frame, text="Uygulamayı Kapat", command=root.quit)
close_button.pack()

# Eğitilmiş duygu analizi modelini yükle
# 此处加载模型，将在GPU配置后进行
model = load_model('model_1.h5')
label_dict = {0: 'Kizgin', 1: 'İgrenme', 2: 'Korku', 3: 'Mutlu', 4: 'Notr', 5: 'Uzgun', 6: 'Saskin'}

# Yüz tanıma için kullanılacak parametreler
DEFAULT_PROTOTXT = "deploy.prototxt.txt"
DEFAULT_MODEL = "res10_300x300_ssd_iter_140000.caffemodel"
DEFAULT_CONFIDENCE = 0.5

# Caffe modelini diskten yükle
net = cv2.dnn.readNetFromCaffe(DEFAULT_PROTOTXT, DEFAULT_MODEL)

# Video akışını başlat
vs = VideoStream(src=0).start()
time.sleep(2.0)

# Kamera görüntüsünü güncelle
def update_video():
    frame = vs.read()
    frame = imutils.resize(frame, width=900)
    (h, w) = frame.shape[:2]

    blob = cv2.dnn.blobFromImage(cv2.resize(frame, (300, 300)), 1.0, (300, 300), (104.0, 177.0, 123.0))
    net.setInput(blob)
    detections = net.forward()

    for i in range(0, detections.shape[2]):
        confidence = detections[0, 0, i, 2]

        if confidence < DEFAULT_CONFIDENCE:
            continue

        box = detections[0, 0, i, 3:7] * np.array([w, h, w, h])
        (startX, startY, endX, endY) = box.astype("int")

        face_roi = frame[startY:endY, startX:endX]
        face_gray = cv2.cvtColor(face_roi, cv2.COLOR_BGR2GRAY)
        face_img = cv2.resize(face_gray, (48, 48))

        face_img_array = image.img_to_array(face_img)
        face_img_array = np.expand_dims(face_img_array, axis=0)
        face_img_array = np.expand_dims(face_img_array, axis=-1)

        # 情感分析模型预测
        predictions = model.predict(face_img_array)
        emotion_label_index = np.argmax(predictions)
        predicted_emotion = label_dict[emotion_label_index]

        cv2.rectangle(frame, (startX, startY), (endX, endY), (0, 0, 255), 2)
        cv2.putText(frame, f'Duygu: {predicted_emotion}', (startX, startY - 10), cv2.FONT_HERSHEY_SIMPLEX, 0.9, (255, 255, 255), 2, cv2.LINE_AA)

    # Yüz eşleştirme için resimleri yükle
    image_folder = "image"  # Resimlerin bulunduğu klasör

    known_faces = []
    known_face_names = []

    for file in os.listdir(image_folder):
        if file.endswith(".jpg"):
            file_path = os.path.join(image_folder, file)
            img = face_recognition.load_image_file(file_path)
            encoding = face_recognition.face_encodings(img)[0]  # Her bir resmin yüz kodlamasını al
            known_faces.append(encoding)
            known_face_names.append(os.path.splitext(file)[0])  # Dosya adını yüz ismi olarak ekle

    # Yüz eşleştirme
    face_locations = face_recognition.face_locations(frame)
    unknown_face_encodings = face_recognition.face_encodings(frame, face_locations)

    for face_encoding in unknown_face_encodings:
        matches = face_recognition.compare_faces(known_faces, face_encoding)
        name = "Bilinmiyor"  # Eğer eşleşme yoksa

        if True in matches:
            first_match_index = matches.index(True)
            name = known_face_names[first_match_index]

        # Yüzün etrafına

登录后复制

以上就是Python脚本中TensorFlow GPU加速配置指南的详细内容，更多请关注php中文网其它相关文章！