This video describes the fundamentals of digital fringe projection techniques, which provide dense 3D measurements of dynamically changing surfaces. It also demonstrates the design and operation of a high-speed binary defocusing system based on these techniques.
Projeção franja Digital (DFP) técnicas fornecer medições 3D densas de mudar dinamicamente as superfícies. Como os olhos humanos e cérebro, DFP usa a triangulação entre os pontos correspondentes em duas vistas da mesma cena em ângulos diferentes para calcular a profundidade. No entanto, ao contrário de um método baseado em estéreo, o DFP usa um projetor de vídeo digital para substituir uma das câmeras 1. O projetor rapidamente projeta um padrão sinusoidal conhecido no assunto, ea superfície do assunto distorce esses padrões no campo de visão da câmera. Três padrões distorcidos (imagens franja) da câmera pode ser usada para calcular a profundidade usando triangulação.
Ao contrário de outros métodos de medição em 3D, técnicas de DFP levar a sistemas que tendem a ser mais rápido, mais baixo custo do equipamento, mais flexível e mais fácil de desenvolver. Sistemas DFP também pode alcançar a mesma resolução de medição como a câmera. Por esta razão, DFP e outras técnicas digitais de luz estruturada foram recentementefoco de intensa pesquisa (conforme resumido em 1-5). Aproveitando DFP, a unidade de processamento gráfico e algoritmos otimizados, temos desenvolvido um sistema capaz de aquisição de 30 Hz 3D vídeo dados, reconstrução e visualização de mais de 300.000 pontos de medição por quadro 6,7. Métodos desfoque DFP binários podem alcançar velocidades ainda maiores 8.
Diversas aplicações podem se beneficiar de técnicas de DFP. Nossos colaboradores têm utilizado os nossos sistemas para análise da função facial 9, animação facial 10, estuda mecânica cardíaca de 11, e medições de superfície do fluido, mas existem muitas outras aplicações potenciais. Este vídeo vai ensinar os fundamentos das técnicas de DFP e ilustrar o projeto e operação de um sistema de DFP desfoque binário.
Digital fringe projection (DFP) techniques are based upon correlation and triangulation between two views of the same scene at different angles, the same principle employed by the human eyes and brain to achieve stereo vision. However, unlike a stereo-based method, DFP uses a digital video projector to replace one of the cameras1. The projector rapidly projects a known sinusoidal pattern onto the object that the object’s surface distorts in the camera’s view. Three such distorted patterns (fringe images) at differing phase shifts from each other can be analyzed to retrieve the depth via triangulation. The use of a known pattern eliminates the difficult computational problem of identifying correspondence points, allowing the capture of depth measurements at the camera resolution. For example, with a 576 x 576 camera, the technique can capture 331,776 points. This allows DFP systems to measure very fine details such as the movement of facial muscles in human emotions.
3D optical imaging techniques for static or quasi-static events have been extensively studied over the past few decades and have seen great success in video game design, animation, movies, music videos, virtual reality, telesurgery, and many engineering disciplines5. Though numerous 3D profilometry techniques exist, they can be classified into two categories: surface contact methods and surface noncontact methods. Both the coordinate measurement machine (CMM) and the atomic force microscope (AFM) require contact with the measuring surface to obtain 3D profiles at high accuracy. This requirement places severe restrictions on the speed of contact methods. They cannot reach kHz measurement speed with thousands of points per scan.
Surface noncontact techniques typically utilize optical triangulation methods (e.g. stereo vision, spacetime stereo, structured light). By actively projecting known patterns onto the objects, structured light techniques can be used to measure surfaces without strong local texture variations1. Fringe analysis is a special group of structured light techniques that uses sinusoidal structured patterns (also known as fringe patterns). Because these patterns have intensities that vary continuously from point to point in a known manner, they boost the structured light techniques from projector-pixel resolution to camera-pixel resolution12. In the recent past, fringe analysis techniques were instrumental in achieving high-resolution 3D imaging.
The digital fringe projection (DFP) technique uses digital video projectors to generate sinusoidal fringe patterns. This technique has the merits of lower cost, higher speed, and simplicity of development, and it has been a very active research area within the past decade. Recent developments in DFP and similar digital structured light techniques are summarized in1-5. To achieve high-speed applications, a digital-light-processing (DLP) projector is preferable due to its fundamental operation mechanism. The speed and flexibility of this technique has allowed us to acquire 3D video at 40 Hz 13 and then later at 60 Hz 6,7.
Nevertheless, a fundamental speed limit exists for the traditional DFP technique. A DLP projector can only swap 8-bit color images at its maximum refresh rate (typically 120 Hz). Since the traditional fringe patterns are 8-bit grayscale images, we can encode three of them into one color image as the red, green, and blue color channels. The projector will swap each channel (and therefore each fringe pattern) at three times the refresh rate (typically 360 Hz). However, since each 3D video frame requires three fringe patterns, the maximum rate of 3D video capture is still only the refresh rate (120 Hz)3,14. To break past this hardware limitation, we have invented a modified DFP technique that uses binary defocusing8. Instead of 8-bit grayscale fringe patterns, this technique uses computer-generated 1-bit binary structured patterns. These patterns are defocused using the projector lens to become pseudo-sinusoidal patterns for DFP. Because DLP projectors can display binary images orders-of-magnitude faster than 8-bit grayscale images, the binary defocusing technology permits tens of kilohertz 3D video imaging speed with the same resolution as the conventional DFP techniques15.
The overall goal of the following protocol is to demonstrate the basic implementation and operation of a binary defocusing three-step phase-shifting DFP system. First, the protocol will cover the selection and integration of the necessary components. Then, it will discuss the simplest, most readily accessible method of calibration for the system; more complex calibration methods are available in the literature for specific applications16,17. The protocol will then focus on the procedure for 3D video capture with the system and the process for converting the fringe images into visualized 3D measurements. Finally, we will present some representative results from our real-time and high-speed systems.
This high-resolution, real-time to superfast 3D video imaging technology is a platform technology that could potentially benefit numerous and diverse scientific fields ranging from biological science to engineering practice. Biomedical applications include precision measurements of facial movements and organ surfaces. Other applications include 3D automated quality control with detection of warped surface features; 3D enhanced videoconferencing; detailed digitization of facial features for movies and videogames; dense and rapid deformation measurements for the design and analysis of structures; and fluid surface characterization. Many biological and engineering applications (e.g. beating rabbit hearts, fluid shockwaves) require the superfast imaging rates of a binary defocusing system to correctly resolve features without aliasing artifacts.
Nevertheless, many challenges remain to the widespread adoption of this technology. Conventional DFP technology requires the projector to display 8-bit grayscale sinusoidal fringe patterns. The speed of this technique is limited by the projector’s refresh rate (typically 120 Hz). This speed is sufficient for slow motion capture such as that in facial expressions. However, numerous applications exist that require faster capture rates.
Binary defocusing technology has relaxed this speed limitation, and we have successfully created a superfast 3D video imaging system. However, this system has two drawbacks. First, it requires an expensive projector such as the DLP Discovery platform and a costly high-speed video camera such as the Vision Research Phantom v9.1. Second, since it generates sinusoidal patterns via the defocusing of squared binary patterns, the binary defocusing technique has difficulty generating sinusoidal fringes of the same quality as the traditional DFP technique and a reduced depth measurement range (for further explanation, see23). Recent investigation indicates that dithered binary sinusoidal patterns can significantly alleviate the limitations on depth measurement range19. Future research will focus on overcoming the remaining issues while preserving the merits of binary defocusing.
Another challenge is compressing and storing the large amount of data generated by high-speed, high-resolution 3D video imaging systems. Uncompressed 3D videos are drastically larger than uncompressed 2D videos. For instance, for a 3D video recorded at 30 Hz for 1 min at a resolution of 640 x 480, the .OBJ file size could be over 50 GB, making it extremely difficult to store. Since little progress has been made in the 3D video compression field, we will continue to focus on this in the future.
The authors have nothing to disclose.
This research was an accumulated effort that began more than 10 years ago when Dr. Zhang was a graduate student at Stony Brook University. The current and previous students in our team at Iowa State University have contributed tremendously toward advancing this technology to where it is today. This work was partially sponsored by National Science Foundation under project number CMMI 1150711, and the William and Virginia Binger Foundation.