Click here to Skip to main content
15,881,204 members
Articles / Programming Languages / Markdown

Capturing Images from Camera using Python and DirectShow

Rate me:
Please Sign up or sign in to vote.
4.20/5 (4 votes)
13 Jan 2019CPOL6 min read 78.4K   1.6K   8   29
In this article, I propose a simple application written entirely in Python that allows you to capture images from a camera using DirectShow and to perform simple processing on them using OpenCV.

Introduction

Using OpenCV in Python is a very good solution to prototype vision applications, it allows you to quickly draft and test algorithms. It’s very easy to process images read from files, not so easy if you want to process images captured from a camera. OpenCV provides some basic methods to access the camera linked to the PC (through the object VideoCapture), but most of the time, they aren’t enough even for a simply prototype. For instance, it’s not possible to list all the cameras linked to the PC and there isn’t a quick way to tune the parameters of the camera. Alternatively, you can use PyGame or the SDK provided by the camera manufacturer, if available.

In Windows to interact with the cameras, DirectShow is often used. Its main strengths are:

  • Almost any camera provides a driver that allows it to be used from DirectShow.
  • It’s a technology well established and widely used.
  • It’s based on the COM framework, so it is designed to be used from different programming languages.

Conversely, it’s quite an old technology that is being replaced by the Windows Media Foundation and Microsoft is not developing it anymore. But it’s not a big deal because it has all the features needed and it’s used in so many applications that (in my opinion), Microsoft will keep it available for a long time.

Here, I want to propose a simple application written entirely in Python that allows you to capture images from a camera using DirectShow, display them on screen and perform simple processing on them using OpenCV. The application is based on a class that exposes some of the functionalities of DirectShow and that can be reused in other applications. The code is designed to be easy to change and to expand.

You can find the code of the application also on https://github.com/andreaschiavinato/python_grabber.

Background

To understand this article, you need basic knowledge of Python and of the Windows application development.

Using the Application

  1. Make sure the following Python packages are installed: numpy, matplotlib, opencv-python, comtypes.
  2. Run main.py.
  3. Once the application is started, the dialog Select a video device is shown. Select a camera, then press Ok.

Image 1

  1. A screen that allows you to select the camera resolution will be shown. Select the desired resolution or leave the default values and press Ok.
  2. The camera live stream is shown on the left part of the screen.
  3. Press Grab to capure a photo, the photo will be shown on the right part of the screen. You can apply some filters to the photo using the buttons on the right part of the screen or save the picture pressing Save.

Image 2

On Windows 10, if the size of text has been set to a value different from 100% (on the Display settings screen of Windows), the live image of the camera may show not well aligned on the containing frame. This seems an issue of DirectShow, since I have the same behaviour on other applications.

DirectShow in Short Words

DirectShow is a framework to write multimedia applications. The building block of a DirectShow application is a filter, an object that performs a base operation like:

  • Reading audio\video from a camera or a file
  • Converting an audio\video from a format to another one
  • Rendering audio\video to the screen or to a file

The filters can have input and output pins, and they can be connected together in order to perform the required job

DirectShow provides an object called Filter Graph that is responsible to collect the filters, link them and run them.

For our application, we need the following filters:

  • A camera source filter, that reads the images from the camera
  • A SampleGrabber filter that allows us to do some operation on each frame provided by the camera
  • A video render filter that displays the live stream of the camera on the GUI

Depending on the camera, we may need additional filters to convert the images provided by the camera in images that can be used by the SampleGrabber. These additional filters are added automatically by directshow.

If you want to learn more about DirectShow, it's worth reading the documentation provided by Microsoft.

Using the Class FilterGraph

You can use the class FilterGraph (contained in the file dshow_graph.py) as standalone. The class represent a DirectShow Filter Graph object.

Example 1

This code lists the cameras connected to your PC:

Python
from pygrabber.dshow_graph import FilterGraph

graph = FilterGraph()
print(graph.get_input_devices())

Example 2

This code shows a screen with the live image from the first camera in your PC.

We add to the graph two filters: one is a source filter corresponding to the first camera connected to your PC, the second is the default render, that shows the images from the camera in a window on the screen. Then we call prepare, that connects the two filters together, and run, to execute the graph.

Finally, we need a method to pause the program while watching the camera video. I use the Tkinter mainloop function which fetches and handles Windows events, so the application doesn’t seem frozen.

Python
from pygrabber.dshow_graph import FilterGraph
from tkinter import Tk

graph = FilterGraph()
graph.add_input_device(0)
graph.add_default_render()
graph.prepare()
graph.run()
root = Tk()
root.withdraw() # hide Tkinter main window
root.mainloop()

Example 3

The following code uses the sample grabber filter to capture single images from the camera. To capture an image, the method grab_frame is called. The image will be retrieved from the callback function passed as parameter to the add_sample_grabber method. In this case, the image captured is shown using the function imshow of opencv.

Python
from pygrabber.dshow_graph import FilterGraph
import cv2

graph = FilterGraph()
cv2.namedWindow('Image', cv2.WINDOW_NORMAL)
graph.add_input_device(0)
graph.add_sample_grabber(lambda image: cv2.imshow("Image", image))
graph.add_null_render()
graph.prepare()
graph.run()
print("Press 'C' or 'c' to grab photo, another key to exit")
while cv2.waitKey(0) in [ord('c'), ord('C')]:
    graph.grab_frame()
graph.stop()
cv2.destroyAllWindows()
print("Done")

Example 4

The following code captures an image from a camera in a synchronous way. An Event object is used to block the main thread until the image is ready.

The image is shown using matplotlib. Note that the function np.flip is used to invert the last dimension of the image, since the image returned by the sample grabber filter has the BGR format, but mathplotlib requires the RGB fromat. We didn't do this on the previous example since OpenCV accepts the format BGR.

Python
import threading
import matplotlib.pyplot as plt
import numpy as np
from pygrabber.dshow_graph import FilterGraph

image_done = threading.Event()
image_grabbed = None

def img_cb(image):
    global image_done
    global image_grabbed
    image_grabbed = np.flip(image, 2)
    image_done.set()

graph = FilterGraph()
graph.add_input_device(0)
graph.add_sample_grabber(img_cb)
graph.add_null_render()
graph.prepare()
graph.run()
input("Press ENTER to grab photo")
graph.grab_frame()
image_done.wait(1000)
graph.stop()
plt.imshow(image_grabbed)
plt.show()

Example 5

The following code is an improvement of Example 4 that allows you to capture images from two camera at the same time.

Python
import threading
import matplotlib.pyplot as plt
import numpy as np
from pygrabber.dshow_graph import FilterGraph

class Camera:
    def __init__(self, device_id):
        self.graph = FilterGraph()
        self.graph.add_input_device(device_id)
        self.graph.add_sample_grabber(self.img_cb)
        self.graph.add_null_render()
        self.graph.display_format_dialog()
        self.graph.prepare()
        self.graph.run()

        self.image_grabbed = None
        self.image_done = threading.Event()

    def img_cb(self, image):
        self.image_grabbed = np.flip(image, 2)
        self.image_done.set()

    def capture(self):
        self.graph.grab_frame()

    def wait_image(self):
        self.image_done.wait(1000)
        return self.image_grabbed        

print("Opening first camera")
camera1 = Camera(0)
print("Opening second camera")
camera2 = Camera(1)
input("Press ENTER to grab photos")
camera1.capture()
camera2.capture()
print("Waiting images")
image1 = camera1.wait_image()
image2 = camera2.wait_image()
print("Done")
ax1 = plt.subplot(2, 1, 1)
ax1.imshow(image1)
ax2 = plt.subplot(2, 1, 2)
ax2.imshow(image2)
plt.show()

Example 6

ScreenCaptureRecorder is a software that includes a direct show input filter that provides images taken from the current screen. We can use it with the FilterGraph class to create an app that captures screenshots.

You can install ScreenCaptureRecorder from https://github.com/rdp/screen-capture-recorder-to-video-windows-free/releases. In my case, the installer did not register the capture source filter, and I had to do this operation manually executing the following commands:

Regsvr32 C:\Program Files (x86)\Screen Capturer Recorder\screen-capture-recorder.dll
Regsvr32 C:\Program Files (x86)\Screen Capturer Recorder\screen-capture-recorder-x64.dll

Below is a variation of Example 4 that makes sure the screen-capture-recorder filter is used as input device, and that can be used to capture screenshots:

Python
import threading
import matplotlib.pyplot as plt
import numpy as np
from pygrabber.dshow_graph import FilterGraph

image_done = threading.Event()
image_grabbed = None

def img_cb(image):
    global image_done
    global image_grabbed
    image_grabbed = np.flip(image, 2)
    image_done.set()

graph = FilterGraph()
screen_recorder_id = next(device[0] for device in enumerate(graph.get_input_devices())
                          if device[1] == "screen-capture-recorder")
graph.add_input_device(screen_recorder_id)
graph.add_sample_grabber(img_cb)
graph.add_null_render()
graph.prepare()
graph.run()
input("Press ENTER to capture a screenshot")
graph.grab_frame()
image_done.wait(1000)
graph.stop()
plt.imshow(image_grabbed)
plt.show()

History

  • 14th January, 2019: Initial version

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


Written By
Software Developer
Unknown
This member has not yet provided a Biography. Assume it's interesting and varied, and probably something to do with programming.

Comments and Discussions

 
QuestionICameraControl? Pin
Member 1541723623-Nov-21 0:51
Member 1541723623-Nov-21 0:51 
BugSetProperties button press causes crash Pin
Member 1510536418-Mar-21 14:56
Member 1510536418-Mar-21 14:56 
GeneralRe: SetProperties button press causes crash Pin
Andreask842-Apr-21 9:06
Andreask842-Apr-21 9:06 
GeneralRe: SetProperties button press causes crash Pin
Member 1541723621-Nov-21 22:55
Member 1541723621-Nov-21 22:55 
QuestionMismatched scan lines Pin
virtual3d28-Sep-20 10:03
virtual3d28-Sep-20 10:03 
AnswerRe: Mismatched scan lines Pin
Andreask8429-Sep-20 19:51
Andreask8429-Sep-20 19:51 
QuestionProgram closes after showing image for moment after selecting video device Pin
John Biondo7-Jul-20 18:00
John Biondo7-Jul-20 18:00 
AnswerRe: Program closes after showing image for moment after selecting video device Pin
Andreask8410-Jul-20 10:40
Andreask8410-Jul-20 10:40 
Questionget_input_devices updates itself when camera device is physically disconnected during runtime? Pin
Member 148499191-Jun-20 2:43
Member 148499191-Jun-20 2:43 
AnswerRe: get_input_devices updates itself when camera device is physically disconnected during runtime? Pin
Andreask8423-Jun-20 3:35
Andreask8423-Jun-20 3:35 
QuestionHow to get the camera properties, also getting the white screen in the preview window Pin
tulasi dass25-Apr-20 8:16
tulasi dass25-Apr-20 8:16 
AnswerRe: How to get the camera properties, also getting the white screen in the preview window Pin
Andreask841-May-20 2:39
Andreask841-May-20 2:39 
GeneralRe: How to get the camera properties, also getting the white screen in the preview window Pin
tulasi dass1-May-20 7:19
tulasi dass1-May-20 7:19 
GeneralRe: How to get the camera properties, also getting the white screen in the preview window Pin
Andreask842-May-20 1:35
Andreask842-May-20 1:35 
GeneralRe: How to get the camera properties, also getting the white screen in the preview window Pin
Member 149051564-Aug-20 8:15
Member 149051564-Aug-20 8:15 
QuestionCamera Resolution increase? Pin
scazzuso31-Mar-20 3:53
scazzuso31-Mar-20 3:53 
AnswerRe: Camera Resolution increase? Pin
Andreask8431-Mar-20 22:16
Andreask8431-Mar-20 22:16 
Hi,

You can get the last version of the code from https://github.com/andreaschiavinato/python_grabber.
On the GUI, you can set the camera resolution using the command Set format.. from the camera menu.
You can set the resolution by code too, check this file: https://github.com/andreaschiavinato/python_grabber/blob/master/examples/setting_format.py
GeneralRe: Camera Resolution increase? Pin
scazzuso1-Apr-20 2:04
scazzuso1-Apr-20 2:04 
Generalerror in main.py and close program next to apply properties Pin
Member 1472071619-Jan-20 8:06
Member 1472071619-Jan-20 8:06 
GeneralRe: error in main.py and close program next to apply properties Pin
Andreask8415-Feb-20 3:08
Andreask8415-Feb-20 3:08 
Questionpassing live frames from the rendered to opencv Pin
marwanrassi12-Sep-19 23:50
marwanrassi12-Sep-19 23:50 
AnswerRe: passing live frames from the rendered to opencv Pin
OriginalGriff12-Sep-19 23:52
mveOriginalGriff12-Sep-19 23:52 
AnswerRe: passing live frames from the rendered to opencv Pin
Andreask8423-Oct-19 9:53
Andreask8423-Oct-19 9:53 
GeneralRe: passing live frames from the rendered to opencv Pin
Member 1495044728-Sep-20 2:20
Member 1495044728-Sep-20 2:20 
GeneralRe: passing live frames from the rendered to opencv Pin
Member 1484991928-Sep-20 5:13
Member 1484991928-Sep-20 5:13 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.