.. _camera: Camera ================== .. highlight:: python In this tutorial, you will learn the following: * Create a camera ``ICamera`` and mount it to an actor * Off-screen rendering for RGB, depth, point cloud and segmentation The full script can be downloaded here :download:`camera.py <../../../../examples/rendering/camera.py>` Create and mount a camera ------------------------------------------------------------ First of all, let's set up the engine, renderer, scene, lighting, and load a URDF file. .. literalinclude:: ../../../../examples/rendering/camera.py :dedent: 0 :lines: 17-36 We create a Vulkan-based renderer by calling ``sapien.VulkanRenderer(offscreen_only=...)``. If ``offscreen_only=True``, the on-screen display is disabled. It works without a window server like x-server. You can forget about all the difficulties working with x-server and OpenGL! Next, you can create a camera and mount it somewhere as follows: .. literalinclude:: ../../../../examples/rendering/camera.py :dedent: 0 :lines: 41-66 Camera should be mounted on an ``Actor``. If the mounted actor is kinematic (and static), then the camera is fixed. Otherwise, the camera moves along with the actor on which it mounts. .. note:: Note that the axes conventions for SAPIEN follow the conventions for robotics, while they are different from those for many graphics softwares (like OpenGL and Blender). For a SAPIEN camera, the x-axis points forward, the y-axis left, and the z-axis upwards. Render RGB image ------------------------------------------------------------ To render from a camera, you need to first update all object states to the renderer. Then, you should call ``take_picture()`` to actually render. .. literalinclude:: ../../../../examples/rendering/camera.py :dedent: 0 :lines: 68-70 Now, we can acquire the RGB image rendered by the camera. To save the image, we use `pillow `_ here, which can be installed by ``pip install pillow``. .. literalinclude:: ../../../../examples/rendering/camera.py :dedent: 0 :lines: 75-80 .. figure:: assets/color.png :width: 1080px :align: center Generate point cloud ------------------------------------------------------------ Point cloud is a common representation of 3D scenes. The following code showcases how to acquire the point cloud in SAPIEN. .. literalinclude:: ../../../../examples/rendering/camera.py :dedent: 0 :lines: 85-86 We acquire a "position" image with 4 channels. The first 3 channels represent the 3D position of each pixel in the OpenGL camera space, and the last channel is a flag indicating whether the position is beyond the camera frustrum far plane. .. literalinclude:: ../../../../examples/rendering/camera.py :dedent: 0 :lines: 88-94 Note that the position is represented in the OpenGL camera space, where the negative z-axis points forward and the y-axis is upward. Thus, to acquire a point cloud in the SAPIEN world space (x forward and z up), we provide ``get_model_matrix()``, which returns the transformation from the OpenGL camera space to the SAPIEN world space. We visualize the point cloud by `Open3D `_, which can be installed by ``pip install open3d``. .. figure:: assets/point_cloud.png :width: 1080px :align: center Besides, the depth map can be obtained as well. .. literalinclude:: ../../../../examples/rendering/camera.py :dedent: 0 :lines: 106-109 .. figure:: assets/depth.png :width: 1080px :align: center Visualize segmentation ------------------------------------------------------------ SAPIEN provides the interfaces to acquire object-level segmentation. .. literalinclude:: ../../../../examples/rendering/camera.py :dedent: 0 :lines: 116-128 There are two levels of segmentation. The first one is mesh-level, and the other one is actor-level. The examples are illustrated below. .. figure:: assets/label0.png :width: 1080px :align: center Mesh-level segmentation .. figure:: assets/label1.png :width: 1080px :align: center Actor-level segmentation Take a screenshot from the viewer ------------------------------------------------------------ The ``Window`` of the viewer also provides the same interfaces as `Camera`, ``get_float_texture`` and ``get_uint32_texture``. Thus, you could take a screenshot by calling them. Notice the definition of ``rpy`` (roll, yaw, pitch) when you set the viewer camera. .. literalinclude:: ../../../../examples/rendering/camera.py :dedent: 0 :lines: 133-154 .. figure:: assets/screenshot.png :width: 1080px :align: center