The conference goal is to provide a unique forum for discussing how Artificial Intelligence could provide benefits to multi-modal image analysis and processing. Multi-modal imaging refers to systems able to acquire multiple 2D or 3D information about real scenes, with different sensing modality (ex: 3D point clouds, visible and infrared images, thermal images, hyperspectral sensing, and so on) and is used on a broad range of sensing-based applications. Artificial Intelligence, on the other hand, found recently a new renaissance, thanks to Machine Learning and Deep Learning paradigms successfully applied for addressing very challenging image interpretation tasks. In this context, researchers, developers and practitioners are encouraged to present the latest advance, highlighting how multi-modal sensing technologies and applications can benefit from using Artificial Intelligence based methodologies. The conference, with a specific emphasis on exploiting Artificial Intelligence methodologies, is focused on both: a) the metric performance of sensors and algorithms for producing the most accurate and reliable geometric measurements and models; and b) applications in different fields.

The conference targets topics related to multimodal imaging systems (calibration, performance, accuracy, etc.) and their application in various tasks such as object recognition, motion estimation, 3D reconstruction, autonomous mobile robot navigation, quality control, assembly in manufacturing, security, environment monitoring, medical imaging, holography, biomedical imaging.

Themes such as industrial inspection, material and component testing, virtual museums, motion analysis, mobile robot navigation, marketing and tourism, human body modeling, maritime sciences, medicine, aerospace, automotive, agrifood, security and the exploration of remote and hazardous sites, just to name a few, provide the contexts in which multi-modal sensing and AI methodologies can be synergically applied.
We invite submission of original research contributions, as well as demonstrations of successful applications in, but not limited to, the following technical areas

Multimodal Sensing: Technology
  • 3D passive sensors
  • 3D active sensors
  • hyperspectral imaging
  • light-field 3D sensing
  • full-field methods for inspection (holography, shearography, DIC)
  • thermography
  • interferometry .


  • Multimodal Sensing: Processing
  • calibration and measurements
  • pose estimation
  • image and range based modelling
  • 3D passive reconstruction
  • 3D active reconstruction
  • motion analysis
  • multiview analysis
  • real-time processing technology
  • expert system for detection and diagnosis of defects
  • embedded vision systems.


  • Multimodal Sensing: Applications
  • object recognition
  • scene interpretation
  • autonomous robot navigation
  • robotics
  • surveillance
  • environmental monitoring
  • surface quality control
  • industrial Inspection
  • face analysis
  • nondestructive testing methods
  • noninvasive inspection techniques
  • automation for material testing
  • development compact systems for in-situ inspection
  • monitoring of civil infrastructures (bridges, highways, buildings, Railways)
  • sensors for homeland-security
  • innovative systems for imaging and display systems
  • intelligent farm
  • intelligent factory
  • intelligent building
  • intelligent systems in health and medicine
  • bioinformatics.
  • ;
    In progress – view active session
    Conference 12621

    Multimodal Sensing and Artificial Intelligence: Technologies and Applications III

    27 - 29 June 2023 | ICM Room 12a
    View Session ∨
    • World of Photonics Plenary
    • 1: Multimodal Sensing and Inverse Modelling for Infrastructures and Environmental Monitoring
    • 2: Multimodal Sensing for Robotics
    • Optical Metrology Plenary Session
    • Posters-Wednesday
    • 3: Multimodal Sensing for Inspection
    • 4: Multimodal Sensing Applications I
    • 5: Multimodal Sensing for Health
    • 6: Full-field Imaging and Applications I
    • 7: Full-field Imaging and Applications II
    • 8: Multimodal Sensing Applications II
    World of Photonics Plenary
    27 June 2023 • 14:00 - 15:30 CEST | ICM, Saal 1
    This plenary session features a presentation by Tammy Ma, Lawrence Livermore National Lab. (United States), and Constantin Haefner, Fraunhofer-Institute for Laser Technology (Germany), on laser-driven inertial confinement fusion.
    Session 1: Multimodal Sensing and Inverse Modelling for Infrastructures and Environmental Monitoring
    27 June 2023 • 16:00 - 18:10 CEST | ICM Room 12a
    Session Chairs: Valerio Gagliardi, Univ. degli Studi di Roma Tre (Italy), Livia Lantini, Univ. of West London (United Kingdom)
    12621-1
    Author(s): Saeed Sotoudeh, Stephen Uzor, Livia Lantini, Univ. of West London (United Kingdom), The Faringdon Research Ctr. for Non-Destructive Testing and Remote Sensing (United Kingdom); Kevin Munisami, Univ. of West London (United Kingdom), Signal Processing, Electronics, Automation and Robotics (SPEAR) Research Group (United Kingdom); Fabio Tosti, Univ. of West London (United Kingdom), The Faringdon Research Ctr. for Non-Destructive Testing and Remote Sensing (United Kingdom)
    On demand | Presented live 27 June 2023
    12621-2
    Author(s): Valerio Gagliardi, Univ. degli Studi di Roma Tre (Italy); Antonio Napolitano, Sapienza Univ. di Roma (Italy); Fabrizio D'Amico, Alessandro Calvi, Andrea Benedetto, Univ. degli Studi di Roma Tre (Italy)
    On demand | Presented live 27 June 2023
    Show Abstract + Hide Abstract
    Transport assets are progressively more exposed to several issues, including climate change, vulnerability to natural hazards as subsidence, landslides and seismic phenomena, to mention a few. They can affect the structural integrity of infrastructures causing damages and deteriorations. In addition, the progressive ageing of assets and the degradation of materials, sometimes accelerated by the severe environmental conditions, are not contrasted by effective maintenance actions, because of the inappropriate and insufficient allocation of financial resources. Novel efficient and cost effective methods are needed for large-scale survey of infrastructures, and simultaneous detailed identification of local damages. They support the prioritization of the most effective countermeasures. We propose a method based on Data-Fusion approach, merging multi-source and multi-scale information acquired by multisensors systems. It enhances the interpretation of fused data under a holistic framework. For this purpose, processing methods of spaceborne data, such as Multi-temporal SAR Interferometry, are used to detect displacements of transport assets, with millimetre accuracy. This technique can be effectively complemented by more detailed data detected by UAVs and Ground Based Non-Destructive Testing Methods, including laser scanner surveys and by prospecting data collected by Ground Penetrating Radar. All these data are simultaneously fused and digitally integrated into a comprehensive and high resoluted Digital Twin, providing a useful tool to support operators and public authorities to identify and prioritize maintenance and rehabilitation actions. According to this holistic framework, the introduction of the Digital Twin, which replicates a real asset in a virtual or augmented reality environment, represents a relevant and strategic tool or facility to tackle effectively and efficiently the maintenance issues, enhancing resilience and operating life of the infrastructures. This study shows an experimental application of the proposed methodology for mapping wide areas affected by potential criticalities for the identification of the main vulnerabilities of the infrastructure. The time history of the vertical displacements of the infrastructure and their trends are generated through SAR data processing. The Digital Twin is implemented fusing spatial, aerial, ground-based and geophysical surveys and using also augmented reality tools, in order to have the best interface for visualizing the twin in the real environment. This allows the smart evaluation and monitoring of the infrastructure, this means effective, resoluted, efficient, holistic and up-to-date.
    12621-3
    Author(s): Luca Bianchini Ciampoli, Univ. degli Studi di Roma Tre (Italy); Roberta Santarelli, Sapienza Univ. di Roma (Italy); Pietro Meriggi, Jhon Romer Diezmos Manalo, Univ. degli Studi di Roma Tre (Italy); Alessandra Ten, Sapienza Univ. di Roma (Italy); Ersilia Maria Loreti, Sovrintendenza Capitolina ai Beni Culturali (Italy); Andrea Benedetto, Univ. degli Studi di Roma Tre (Italy)
    On demand | Presented live 27 June 2023
    Show Abstract + Hide Abstract
    Building Information Modeling (BIM) is a software-based parametric design approach that enables complete interoperability between all of the various actors engaged in a design or management process. Notwithstanding It was particularly developed for construction projects, it has since been applied to a variety of fields, including the design of transportation infrastructures and, more recently, cultural heritage. It has primarily been used in this area to improve the accuracy and efficiency of activities aimed at stabilizing and restoring historical structures. The purpose of the current study is to show how BIM is implemented by multisensors date and how the use of BIM can significantly raise the quality of virtual reconstructions and digital valorizations of historical structures, particularly when the rate of conservation is low. In spite of the fact that modern digital reconstruction models are frequently examined from an archaeological standpoint, their structural coherence has never been examined. This implies that many virtual models are likely to depict historically reliable structures that, due to their lack of structural logic, would not support their weight according to their geometric features and construction methods/materials. The research suggests a novel BIM-based methodology capable of both guiding and structurally testing archaeological reconstruction theories. The following procedure can serve as a broad representation of the model: 1- Multisensors survey of the emerging: gathering information from cursory historical investigations (topographic data, laser scanner, aero photogrammetry, satellite images) 2- Multisensors survey of the hidden: gathering information from ground-penetrating radar, electrical tomography, and magnetometry hypogeal scans; 3- Mechanical characterization: collecting data on the materials used in the discovery and confirming their mechanical properties through load stress tests; 4- Virtual reconstruction: the formulation of a potential hypothesis related to structural and morphological characteristics that are known to have existed during the referred historical eras; 5- Structural test: engineering and structural verification of the proposed theory using finite element algorithms and graphic statics techniques. In the framework of the Project BIMHERIT, funded by Regione Lazio, the proposed methodology was tested at a preliminary stage on the archaeological site of the Villa and Circus of Maxentius along the Ancient Appian Way in Rome. All planned activities have been shared with and authorized by the Sovrintendenza Capitolina ai Beni Culturali (DTC Lazio Call, Prot. 305-2020-35609).
    12621-4
    Author(s): Boualem Merainani, Thibaud Toullier, Jean Dumoulin, Univ. Gustave Eiffel (France)
    On demand | Presented live 27 June 2023
    Show Abstract + Hide Abstract
    Hot boxes, which refer to overheated rail-road car wheels and bearings, pose a significant threat to railway operations. Failure to detect and address hot boxes promptly can lead to catastrophic accidents such as derailments and fires. The main contribution of this paper lies with the development of an automatic hot box detection, tracking and counting method using the IR thermal cameras. The method combines the YOLO algorithm with the Kalman filter (KF) as a tracker. Original dataset was constructed from experiments conducted at different times of the day. The method points the way towards efficient and low-cost hot box detectors.
    12621-5
    Author(s): Giuseppe Esposito, Alvaro Y. Ruiz, Istituto per il Rilevamento Elettromagnetico dell'Ambiente (Italy); Gianluca Gennarelli, IREA-CNR (Italy); Giovanni Ludeno, Rosa Scapaticci, Roberta Palmeri, Lorenzo Crocco, Ilaria Catapano, Francesco Soldovieri, Istituto per il Rilevamento Elettromagnetico dell'Ambiente (Italy)
    On demand | Presented live 27 June 2023
    Show Abstract + Hide Abstract
    The timely detection of leakage in water mains is an issue that is relevant to the sustainable and efficient use of natural resources and the prevention of environmental hazards and risks for citizens. Consequently, the development of non-destructive techniques capable of detecting and localizing water leaks in buried pipelines is of huge interest. In this contribution, we present an artificial intelligence tool to perform automatic leakage detection from ground penetrating radar tomographic images. Ground penetrating radar is a prominent technology for subsoil inspection based on the remote interaction of microwave signals with buried anomalies, but its results require expert-users and are prone to subjective interpretation. This can be counteracted by processing raw-data using microwave tomography algorithms, which are capable of delivering more easily interpretable images. However, tomographic images can be still very difficult to interpret when the assumptions underlying the algorithm fail and therefore do not lead to conclusive results. To overcome this issue, we cast the leakage-detection problem as an image segmentation task, in which the popular convolutional neural network U-NET is trained to turn tomographic images obtained from raw-data processing into binary images clearly depicting the location of the leaks. Preliminary results with full-wave synthetic data confirm the potential of the proposed approach.
    12621-6
    Author(s): Saeed Parnow, Stephen Uzor, Livia Lantini, Fabio Tosti, Univ. of West London (United Kingdom), The Faringdon Research Ctr. for Non-Destructive Testing and Remote Sensing (United Kingdom)
    On demand | Presented live 27 June 2023
    Session 2: Multimodal Sensing for Robotics
    28 June 2023 • 08:20 - 10:00 CEST | ICM Room 12a
    Session Chairs: Vito Renò, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato (Italy), Vittorio Sala, SUPSI (Switzerland)
    12621-7
    Author(s): Pablo García-Gómez, BEAMAGINE S.L. (Spain); Eduardo Bernal, Ana Rodríguez-Aramendía, Noel Rodrigo, Univ. Politècnica de Catalunya (Spain); Jordi Riu, BEAMAGINE S.L. (Spain); Santiago Royo, Univ. Politècnica de Catalunya (Spain)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    AAutomated systems increase their requirements in all fields, tightening their performance requirements in aspects like reliability, and ease of manipulation. Within this communication, we will present the development of a compact perception unit which includes a 3D lidar, RGB and thermal imaging for advanced perception purposes. The proposed unit intends to solve the usual hardware problems that software developers intend to solve in field applications. The basic features and performance of the system will be presented, and the applicability of the multimodal sensing approach presented to different applications in security, autonomous vehicles, and other application areas will be overviewed with examples.
    12621-13
    Author(s): Adriano Liso, Angelo Cardellicchio, Cosimo Patruno, Massimiliano Nitti, Ettore Stella, Vito Renò, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato, Consiglio Nazionale delle Ricerche (Italy)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    Detecting defects that may arise in weldings using non-destructive tests is an area of active research. However, non-destructive tests require a significant effort from domain experts, whose evaluations may also be affected by subjectivity. To address these challenges, this work proposes an end-to-end framework for anomaly detection in linear aluminum welding, feeding its three-dimensional representation to an autoencoder trained to associate a small reconstruction error on non-defective samples, while providing large errors when fed with defective ones. Preliminary results show the effectiveness and feasibility of the proposed method, which can achieve an accuracy of over 80%.
    12621-9
    Author(s): Dun-Ren Liu, Chia-Wei Hsu, Chun-Ting Sung, Ting-Chien Chen, Shean-Jen Chen, National Yang Ming Chiao Tung Univ. (Taiwan)
    On demand | Presented live 28 June 2023
    12621-10
    Author(s): Vittorio Sala, Ambra Vandone, Stefano Baraldo, Michele Banfi, Angelo Baj, Francesco Impaziente, Federico Mazzucato, Anna Valente, SUPSI (Switzerland)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    Direct Energy Deposition (DED) is an additive manufacturing technology that allows for both the realization of complex shape objects and the repairing of damaged or worn metallic components. A high-power laser beam spans the substrate surface while metallic powder is added by nozzles, creating overlapping solid tracks and thus building up parts in a layer-wise fashion. Despite fine parameters optimization, the process instability may lead to layer height variations that could accumulate over time affecting the stand-off distance between the printing head and the substrate and impairing the printed part geometry. In the present work, an innovative process monitoring approach has been investigated. Sensors such as a pyrometer and RGB camera have been integrated into a DED machine, to measure the radiation emitted by the melted material at different spectral bandwidths. A neural network approach has been derived to estimate the stand-off distance and highlight possible anomalous situations. The training dataset has been acquired by printing tracks in several conditions. The proposed solution has been validated by comparing the geometry estimated on defective multilayer parts with the measurements provided by a high-resolution 3D scanner. Preliminary results show that the proposed method is a promising approach to prevent critical stand-off deviations.
    12621-11
    Author(s): Ting-Chien Chen, Yu-Cheng Sung, Chai-Wei Hsu, Dun-Ren Liu, Shean-Jen Chen, National Yang Ming Chiao Tung Univ. (Taiwan)
    On demand | Presented live 28 June 2023
    Break
    Coffee Break 10:00 - 10:30
    Optical Metrology Plenary Session
    28 June 2023 • 10:30 - 11:25 CEST | ICM, Saal 1
    10:30 to 10:40 hrs
    Welcome Address and Plenary Speaker Introduction

    Marc P. Georges, Liège Univ. (Belgium)
    Jörg Seewig, Technische Univ. Kaiserslautern (Germany)
    2023 Symposium Chairs
    PC12622-500
    Remote photonic medicine (Plenary Presentation)
    Author(s): Zeev Zalevsky, Bar-Ilan Univ. (Israel)
    28 June 2023 • 10:40 - 11:25 CEST | ICM, Saal 1
    Show Abstract + Hide Abstract
    I will present a photonic sensor that can be used for remote sensing of many biomedical parameters simultaneously and continuously. The technology is based upon illuminating a surface with a laser and then using an imaging camera to perform temporal and spatial tracking of secondary speckle patterns in order to have nano metric accurate estimation of the movement of the back reflecting surface. The capability of sensing those movements in nano-metric precision allows connecting the movement with remote bio-sensing and with medical diagnosis capabilities. The proposed technology was already applied for remote and continuous estimation of vital bio-signs (such as heart beats, respiration, blood pulse pressure and intra ocular pressure), for molecular sensing of chemicals in the blood stream (such as for estimation of alcohol, glucose and lactate concentrations in blood stream, blood coagulation and oximetry) as well as for sensing of hemodynamic characteristics such as blood flow related to brain activity. The sensor can be used for early diagnosis of diseases such as otitis, melanoma and breast cancer and lately it was tested in large scale clinical trials and provided highly efficient medical diagnosis capabilities for cardiopulmonary diseases. The capability of the sensor was also tested and verified in providing remote high-quality characterization of brain activity.
    Break
    Lunch Break 11:30 - 12:30
    Posters-Wednesday
    28 June 2023 • 12:30 - 13:30 CEST | ICM, Hall B0
    Poster authors, please set up posters between the morning coffee break and the end of lunch break on Wednesday. Plan to stand by your poster to discuss it with session attendees during the poster session. Remove your poster following the poster session conclusion as posters left on the boards will be discarded.
    12621-34
    Author(s): Evgeny A. Semenishchev, Moscow State Univ of Technology "STANKIN" (Russian Federation); Inessa Gracheva, Nikita Mityuov, Egor Surkov, Tula State University (Russian Federation); Viacheslav Voronin, Moscow State Univ of Technology (Russian Federation); Daniil Laykhov, Tula State University (Russian Federation); Aleksandr Zelensky, Moscow State Univ of Technology (Russian Federation)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    The modern development of microelectronics and the transition to get new technologies have made it form to minimize video data recording devices. Using mobile or small fixation complexes has found its application in many areas. The most popular technology has become a technology that allows you to expand the possibilities of human perception using other electromagnetic ranges. The most popular and used for practical tasks are the ranges of X-ray radiation (dangerous for humans) and infrared radiation (safe for humans at low powers). The near IR spectrum allows you to see in the dark or foggy conditions, the far IR spectrum allows you to form thermal portraits of objects. The imperfection and presence of inhomogeneities in the sensors adds a noise component to the generated data. Most often, to eliminate noise, the averaging procedure over a series of frames is used, but the relative low performance does not allow this procedure to be performed in real time. The problem of primary data processing for thermal imaging cameras is still relevant. To eliminate the noise component in infrared images, the paper proposes a method of inter-frame data alignment, built on the operations of pre-processing and highlighting the contours of objects and based on the following basic stapes: at the initial stage, the system operation parameters are determined, the frame size is read, the color field depth and speed fixing frames; the operation of selecting local areas with minimization of objects allows you to create a mask of objects and localize subsequent methods; determining the construction of inter-frame pixel shift vectors; accumulation\formation of color intensity change data; data filtering based on a multi-criteria method based on the application of the criterion of the standard deviation of the results and the generated estimates, as well as the standard deviations of the final differences of the second order; frame processing of images by a two-dimensional multi-criteria method with preservation of object boundaries; framing the output image. On the set of test data obtained by the seak and flir cameras, examples of processing objects of simple shapes are given. Examples of visual reduction of the noise component on frames of test images are shown. Examples of error reduction in the formation of the proposed approach on synthetic sequences with superimposed Gaussian noise are given.
    12621-42
    Author(s): Frederik Kammel, Lars Rathmann, Univ. of Freiburg (Germany); Annette Schmitt, Alexander Reiterer, Fraunhofer-Institut für Physikalische Messtechnik IPM (Germany), Univ. of Freiburg (Germany)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    For both economic and research purposes, accurate information on forest health and composition is of paramount importance. However, using conventional methods of surveying can be very time consuming for such an inventory. We present an innovative approach for forest inventory combining UAV-borne LiDAR, multispectral aerial imagery and terrestrial point cloud data measured with a handheld laser scanner. The result of this procedure are radiometrically enhanced 3D point clouds containing detailed information about the forest structure above and underneath its canopy. This provides a versatile basis for further processing such as tree species recognition, e.g., using convolutional neural networks.
    12621-43
    Author(s): Rosa Pia Devanna, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato, Consiglio Nazionale delle Ricerche (Italy); Giovanni Matranga, Istituto di Scienze e Tecnologie per l'Energia e la Mobilità Sostenibili, Consiglio Nazionale delle Ricerche (Italy); Marcella Biddoccu, Istituto di Scienze e Tecnologie per l'Energia e la Mobilità Sostenibili, Consiglio Nazionale delle (Italy); Giulio Reina, Politecnico di Bari (Italy); Annalisa Milella, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato, Consiglio Nazionale delle Ricerche (Italy)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    This study investigates the potential of a consumer-grade infrared stereo camera, i.e. the Intel RealSense D435, to automatically extract crop status information, such as Normalized Difference Vegetation Index (NDVI), in arable and permanent crops. The sensing device includes two infrared (IR) sensors for depth calculation and one colour sensor, which provide, for each point of the scene, both IR and visible light information thus making it possible pixel per pixel NDVI estimations. Measurements were performed on various arable crops including corn (Zea mays) and barley (Ordeum vulgare) and on two vine varieties, Freisa and Malvasia, and were compared to measurements taken by a Trimble GreenSeeker handheld crop sensor. Results show that although the RealSense camera tends to underestimate NDVI values compared to the GreenSeeker, there is a good correlation between the two sensing devices with squared correlation coefficient r2 = 0.68 and root-mean-square error RMSE = 0.07. The fitted regression equation was successively applied to correct new camera observations and compared with the GreenSeeker output, showing good agreement with RMSE of 0.07 and mean squared error (MSE) of 0.01. The use of the RGB-D camera to simultaneously provide NDVI and canopy height measurements by a farmer robot is also demonstrated in a Malvasia vineyard field, showing that the proposed system can be effectively adopted for fully automated plant-scale monitoring of high-value crops.
    12621-44
    Author(s): Kevin Blümel, Michael Kuhl, Finn Alfred Hölzel, Karl Kunze, Hochschule Mittweida (Germany)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    A microcontroller-based multi-sensor system using ultrasonic range sensors and radar sensors for sensor data fusion is presented. Using this system as a warning system can detect and prevent forklift accidents in the industry: The system uses the combined sensor data to detect objects’ distance, angle, and speed and calculate object trajectories to determine the risk of each moving object in the detection area. Because the dimensions of the microcontroller-based warning system are very small, it can be integrated into a warning vest for a factory worker. If the system detects and categorizes objects as risk, the factory worker is noticed by vibrotactile feedback about the location of the risk object. The paper deals with the introduction of the system followed by accuracy study and performance test.
    12621-45
    Author(s): Viacheslav Voronin, Marina Zhdanova, Moscow State Univ of Technology (Russian Federation); Evgeny A. Semenishchev, Moscow State Univ of Technology "STANKIN" (Russian Federation); Alexander Zelensky, Nikolay Gapon, Yurii Ilyukhin, Moscow State Univ of Technology (Russian Federation)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    Automation of production processes using robots is a priority for developing many industrial enterprises. Human-machine interaction is a key component of such control infrastructure. The proposed algorithm is a four-stage procedure: (a) fusion information from multimodal sensors based on the quaternion model, (b) image preprocessing using a 3D Gabor filter, (c) a descriptor calculation using 3D local binary dense micro-block difference with skeleton points, and (d) classification. The proposed algorithm is based on capturing 3D sub-volumes located inside a video sequence patch and calculating the difference in intensities between these sub-volumes; for intensified motion, used the convolution with a bank of 3D arbitrarily oriented Gabor filters and calculating 3D local binary dense micro-block difference. A program was developed for transmitting information about the target points of the robot's movement through a virtual TCP / IP port and a script for working out the target points in the simulation environment. To test the effectiveness of the proposed algorithm, we simulate an action recognition system for Human-robot collaboration in the RoboGuide environment.
    Session 3: Multimodal Sensing for Inspection
    28 June 2023 • 13:30 - 15:30 CEST | ICM Room 12a
    Session Chair: Rufei Zou, Beijing Univ. of Technology (China)
    12621-12
    Author(s): Yu Han, David Salido-Monzú, Andreas Wieser, Institut für Geodäsie und Photogrammetrie, ETH Zurich (Switzerland)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    Multispectral LiDAR is an emerging active remote sensing technique that combines distance and spectroscopy measurements on light reflected from the surface at the respective measurement point. It is known that the reflectance spectrum can be used for material classification. However, the spectrum also depends on other surface parameters, particularly surface roughness. Herein, we propose an extension of multispectral to polarimetric multispectral LiDAR and introduce polarized and unpolarized reflectance spectra as additional features for classifying materials and roughness. We demonstrate the feasibility and the benefit using a bench-top prototype instrument which allows acquiring standard, polarized and unpolarized reflectance spectra, in addition to distance, in 33 spectral channels with 10 nm bandwidth between 580 and 900 nm. We analyze and interpret the raw spectra obtained from measurements on test specimens consisting of five different materials (PE, PVC, PP, sandstone, limestone) with two different levels of surface roughness. Using a linear support vector machine (SVM) we demonstrate the potential of the different features for independent material and roughness classification. The results indicate that the unpolarized reflectance spectrum increases the material classification accuracy by 50% as compared to a standard spectrum, and that the polarized spectrum actually allows classifying roughness. We interpret the results as a strong indication that multispectral polarimetric LiDAR enables deriving practically relevant additional information on surfaces with high spatial resolution through remote sensing.
    12621-8
    Author(s): Simone Pio Negri, Giuseppe Lastilla, Maria di Summa, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato, Consiglio Nazionale del (Italy); Massimiliano Nitti, Istituto di Studi sui Sistemi Intelligenti per l'Automazione (Italy); Carmelo Antonio Ardito, Univ. LUM Giuseppe Degennaro (Italy); Vito Renò, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato, Consiglio Nazionale del (Italy)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    Carbon Fiber Reinforced Plastic (CFRP) is an important material in manufacturing, in particular in automotive and aerospace industry. Its relevance is due to its lightness, resistance and rigidity. In particular the lightness/resistance ratio is the property that will give to CFRP a favourable future in more and more applications, especially where the lightness and resistance are essential. On the other end CFRP is expensive and difficult to produce. During the production process of CFRP’s artefacts some defects may occur (e.g. material inclusions or bubbles), making the component not usable. When applicable, CFRP repair process is structured in: defect removal (scarfing), cleaning and patch application. Actually, scarfing is a slow process manually performed by human operators, strongly dependent on the workers’ skills, that generates a lot of toxic nano dusts. The present paper deals with the implementation of a (semi)automatic robotic cell to repair large CFRP aeronautic components. The robotic cell is composed by a collaborative robot equipped by a custom scarfing tool due to the context-specific constraints to be implemented. Under these constraints, the machining got is not as perfect as in a traditional milling process, thus requiring additional quality control and inspection after scarfing takes place. The paper describes the designed scarfing tool, examines the trajectories generated to execute the scarfing and illustrates some preliminary results in terms of accuracy of the shape got by the designed tool.
    12621-14
    Author(s): Cosimo Patruno, Massimiliano Nitti, Angelo Cardellicchio, Nicola Mosca, Maria di Summa, Vito Renò, Consiglio Nazionale delle Ricerche (Italy)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    This paper presents a preliminary study for evaluating the quality of welds in thermomagnetic switches using 3D sensing and machine learning techniques. A 3D sensor based on laser triangulation is used to gather the point cloud of the component. The point cloud is then processed to extract hand-crafted signatures for binary classification: defective or non-defective component. Features such as Gaussian and mean curvatures, density, and quadric surface properties, are used for building these significant signatures. Different machine learning models, including decision trees, Support Vector Machines, k-nearest neighbors, random forests, ensemble classifiers, and Artificial Neural Networks, are trained using the built signatures to classify the weld as defective or non-defective. Preliminary results on actual data achieve high classification accuracy (>80%) on all the tested models.
    12621-15
    Author(s): Angelo Cardellicchio, Consiglio Nazionale delle Ricerche (Italy); Sergio Ruggieri, Andrea Nettis, Politecnico di Bari (Italy); Nicola Mosca, Consiglio Nazionale delle Ricerche (Italy); Giuseppina Uva, Politecnico di Bari (Italy); Vito Renò, Consiglio Nazionale delle Ricerche (Italy)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    Monitoring and maintaining existing bridge stocks is a critical task. To reduce the required effort, automatic techniques for object detection were proposed. However, many of the existing algorithms either deliver poor accuracy (i.e., template-matching algorithms) or are computationally expensive (i.e., two-stages detectors). To overcome these issues, this work proposes a single-stage damage detector based on YOLOv5. To demonstrate the effectiveness of the proposed approach, a database composed by 6835 images from several bridges with seven classes of defects has been gathered. The overall results show a mean average precision above 60%, with high precision and recall.
    12621-16
    Author(s): Jean-Charles Baritaux, Tommy Dedole, CEA-LETI (France); Silvère Gousset, Etienne Le Coarer, Univ. Grenoble Alpes (France)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    Raman hyperspectral imaging allows the rapid screening of surfaces potentially contaminated by toxic chemicals. We present an application of multivariate models to the detection of target chemical signatures in Fourier-Transform Raman hyperspectral imaging. Working directly on the interferograms (Fourier domain) our method was developed to address two main challenges. First, the presence of possibly strong substrate signal overlapping with the target signal. Second, the variation in measured interferograms across the field of view requiring the use of locally trained models. Detection results obtained on a set of target compounds deposited on polymer substrates are presented.
    12621-17
    Author(s): Jannis Gangelhoff, Christoph S. Werner, Fraunhofer-Institut für Physikalische Messtechnik IPM (Germany); Alexander Reiterer, Albert-Ludwigs-Univ. Freiburg (Germany)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    Mapping of small and shallow water bodies is still primarily done manually. The Fraunhofer IPM developed and evaluated a compact airborne laser-scanning system which weighs under 4 kg and targets topographic and bathymetric surveys. Our sensor simultaneously emits green and infrared light. Combined with full waveform signal analysis, this allows for precise separation of water surface, seabed, and shoreline. A downward-facing palmer scanner deflects the beam in a near-elliptical shape. The system has been tested and evaluated at a quarry lake in turbid water conditions. The result is a high-resolution ground model demonstrating the benefits of a compact bathymetric scanner.
    Break
    Coffee Break 15:30 - 16:00
    Session 4: Multimodal Sensing Applications I
    28 June 2023 • 16:00 - 17:50 CEST | ICM Room 12a
    Session Chairs: Francesco Carlo Morabito, Univ. Mediterranea di Reggio Calabria (Italy), Andrei G. Anisimov, Technische Univ. Delft (Netherlands)
    12621-18
    Author(s): Sumesh Nair, Chai-Wei Hsu, National Yang Ming Chiao Tung Univ. (Taiwan); Yvonne Yuling Hu, National Cheng Kung Univ. (Taiwan); Shean-Jen Chen, National Yang Ming Chiao Tung Univ. (Taiwan)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    Caterpillars have been causing havoc to agriculture due to feeding aggressively on the foliage of the crops. The current methods of pest control like sticky traps or pheromone traps work on adult moths, but not on foraging caterpillars. Chemical means like pesticides are effective, but chemical residues on crops are concerning. Therefore, this study aims to primarily track and estimate the 3D position of the caterpillars in orchards in real-time. To this end, we have employed the state-of-the-art object detector YOLOv7, combined with Simple Online and Realtime Tracking with a Deep Association Metric (DeepSORT) algorithm. This combined approach, when compared to merely YOLOv7, has improved detection up to 25%, courtesy of the DeepSORT embedded tracker. The RGB-D camera is utilized for this work is Intel Realsense D405. For the training data, 2,000 images captured in a jujube orchard with varying exposure, occlusion, and wind conditions were used. Inference was done from completely new images in real time. In the experiments, the YOLOv7+DeepSORT approach makes detections within 14 ms per frame, with an average detection rate of 85%, indicative of is real-time applicability in orchards. The smallest object (around 2-cm length caterpillar) is recognized around 21×12 pixels, which is at a distance of 35 cm from the camera. Thus, this development of YOLOv7+DeepSORT approach can be integrated with technologies like laser robot arms, that can pick the caterpillars, or even into stand-off techniques like laser pest targeting, which can help eradicate the pest problems in a physical manner efficiently.
    12621-19
    Author(s): Eliana Cinotti, Istituto per il Rilevamento Elettromagnetico dell'Ambiente (Italy), Univ. degli Studi di Napoli Federico II (Italy); Giuseppe Esposito, Gianluca Gennarelli, Giovanni Ludeno, Ilaria Catapano, Istituto per il Rilevamento Elettromagnetico dell'Ambiente (Italy); Amedeo Capozzoli, Claudio Curcio, Angelo Liseno, Univ. degli Studi di Napoli Federico II (Italy); Francesco Soldovieri, Istituto per il Rilevamento Elettromagnetico dell'Ambiente (Italy)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    This study addresses a 2D scalar electromagnetic inverse source problem by using a deep neural network-based artificial intelligence technique. Specifically, the Learned Singular Value Decomposition (L-SVD) approach based on hybrid autoencoding is adopted. The main goal is to reproduce the singular value decomposition through neural networks and compare the reconstruction performance of L-SVD and truncated SVD (TSVD) in the case of noiseless data, which represents a reference benchmark. The results demonstrate that L-SVD outperforms TSVD in terms of spatial resolution.
    12621-20
    Author(s): Johannes Gürtler, TU Dresden (Germany); Sami Tasmany, Technische Univ. Graz (Austria); Robert Kuschmierz, TU Dresden (Germany); Jakob Woisetschläger, Technische Univ. Graz (Austria); Jürgen Czarske, TU Dresden (Germany)
    28 June 2023 • 16:50 - 17:10 CEST | ICM Room 12a
    Show Abstract + Hide Abstract
    Multimodal sensing is presented based on the measurement of density oscillations in swirl-stabilized flames, where density is coupled to physical quantities like heat release rate or sound pressure and advection velocity. The measurement approach is a high-speed camera based interferometer combined with a multi-camera background oriented Schlieren system, both enabling line of sight detection and therefore require solution of the inverse problem by tomographic reconstruction. Those reconstructions are realized using a neural network, resulting in the three-dimensional distribution of local thermoacoustic oscillations. Finally, the advection velocity of those oscillating vortex structures is calculated by image correlation.
    12621-21
    Author(s): Pabitro Ray, David Salido-Monzú, Andreas Wieser, Tomislav Medic, ETH Zurich (Switzerland)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    We demonstrate a supercontinuum-based hyperspectral laser scanning technique that provides high-precision distance measurements of natural surfaces along with their reflectance signature over the broad spectral range of the supercontinuum (SC) output. The SC used in our experiments is spectrally broadened to 570-970 nm from a 780 nm mode-locked femtosecond laser. Distance measurements are carried out by monitoring the differential phase delay of the intermode beat notes obtained from direct photodetection of the SC, while the backscattered reflection spectrum is acquired using a commercial spectrometer. We achieve a single-point range precision below 10 μm on natural targets (gypsum board and leaves of a plant used herein) placed at a stand-off distance of 5 m. Our results demonstrate the acquisition of hyperspectral point clouds together with sub-mm range noise on the scanned surface. This range performance is comparable to commercial state-of-the-art terrestrial laser scanners which traditionally employ a monochromatic laser source. We show the benefit of enhanced range precision toward correctly estimating the surface orientation and for radiometric calibration of the acquired intensities. Initial results illustrate the direct 3D mapping of spectral data of plant leaves with a reduced angle of incidence-related bias, highlighting new opportunities for future research into remote sensing of vegetation.
    12621-22
    Author(s): Maria Ballesta-Garcia, Sara Peña-Gutiérrez, Univ. Politècnica de Catalunya (Spain); Ana Rodríguez-Aramendía, Univ. Politècnica de Catalunya (Spain), Beamagine SL (Spain); Pablo García-Gómez, Beamagine SL (Spain); Noel Rodrigo, Santiago Royo, Univ. Politècnica de Catalunya (Spain)
    On demand | Presented live 28 June 2023
    Show Abstract + Hide Abstract
    The interest in LiDAR imaging systems has recently increased in outdoor ground-based applications related to computer vision. However, for the complete settling of the technology, there are still obstacles pending to be solved, being its use in adverse weather conditions one of the most challenging. Under the presence of bad weather, such as fog, the system performance is heavily altered and the quality of the detection becomes severely degraded. We propose to take advantage of the temporal digitization capabilities of a LiDAR setup combined with the use of polarimetric information, to try to improve the detection. We designed and constructed a LiDAR imaging system with outstanding characteristics related to optoelectronic modifications, in particular including the digitization of light pulses and the addition of circularly polarizing optical elements for polarimetric discrimination. To analyse the performance of the system, it was tested in a macro-scale fog chamber, and the acquired signal was analysed and characterized using cross-polarized detection (cross-channel) and co-polarized detection (co-channel). From the data obtained, we could identify and study different relevant phenomena which were only exposed through temporal digitization and which might enable optimized signal processing features. The backscattering signal of light that first interacts with the media is usually responsible for the saturation of the sensor but it may be easily filtered out using cross-channel. False-positive points that appear due to the scattering properties of the media are also reduced in cross-channel. We also demonstrated that cross-channel data shows increased range and post-processing of the point cloud, including detection and segmentation of objects embedded in the fog, is easier. So, we conclude that a system using a cross-configuration polarized light for detection could help to decrease the influence of backscattering from turbid media without compromising the signal returning from the objects of interest in most cases.
    Session 5: Multimodal Sensing for Health
    29 June 2023 • 08:40 - 10:00 CEST | ICM Room 12a
    Session Chairs: Maria di Summa, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato (Italy), Pietro Ferraro, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy)
    12621-24
    Author(s): Wang Liao, Chen Zhang, Xinyu Sun, Technische Univ. Ilmenau (Germany); Gunther Notni, Technische Univ. Ilmenau (Germany), Fraunhofer-Institut für Angewandte Optik und Feinmechanik IOF (Germany)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    Non-contact methods can expand the application scenarios of blood oxygen measurement with better hygiene and comfort, but the traditional non-contact methods are usually less accurate. In this study a novel non-contact approach for measuring peripheral oxygen saturation (SpO2) using deep learning and near-infrared multispectral videos is proposed. After a series of data processing including shading correction, global detrending and spectral channel normalization to reduce the influences from illumination non-uniformity, ambient light, and skin tone, the preprocessed video data are split into half-second clips (30 frames) as input of the 3D convolutional residual network. In the experiment, multispectral videos in 25 channels of hand palms from 7 participants were captured. The experimental results show that the proposed approach can accurately estimate SpO2 from near-infrared multispectral videos, which demonstrates the agreement with commercial pulse oximeter. The study also evaluated the performance of the approach with different combinations of near-infrared channels.
    12621-25
    Author(s): Michal Gontarz, Vibekananda Dutta, Malgorzata Kujawinska, Warsaw Univ. of Technology (Poland)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    This talk presents a deep learning approach to phase unwrapping, which is a troublesome step in the sinogram creation in holographic tomography reconstruction. Through implementation of Attention U-Nets and Residual Blocks, phase is successfully denoised and unwrapped. Experiments show that the proposed solution is capable of unwrapping irregular, noisy and complex phase images with equal or, at times, superior performance. The solution has also been implemented into the holographic tomography reconstruction, which enables evaluation on the refractive index distribution of real specimen, against conventional State-of-the-Art solutions.
    12621-26
    Author(s): Daniele Pirone, Daniele G. Sirico, Lisa Miccio, Vittorio Bianco, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle Ricerche (Italy); Martina Mugnano, Univ. degli Studi di Napoli Federico II (Italy); Pietro Ferraro, Pasquale Memmolo, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle Ricerche (Italy)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    In the recent years, Holographic Imaging Flow Cytometry has been demonstrated as a promising label-free and quantitative technique for the high-throughput analysis at the single-cell level. By recording hundreds of digital holograms per cell while it is flowing and rotating along a microfluidic channel, the 3D tomogram of its refractive indices can be reconstructed from the corresponding phase-contrast maps. Here we show that a fully convolutional end-to-end neural network can speed up the phase retrieval process, thus reducing the computational time for the tomographic reconstruction from tens of minutes to few seconds, while providing high fidelity and small memory footprint.
    12621-27
    Author(s): Maria di Summa, Nicola Mosca, Luciana Colella, STIIMA-CNR (Italy); Vittorio Bianco, Pietro Ferraro, ISASI-CNR (Italy); Ettore Stella, STIIMA-CNR (Italy)
    On demand | Presented live 29 June 2023
    Break
    Coffee Break 10:00 - 10:30
    Session 6: Full-field Imaging and Applications I
    29 June 2023 • 10:30 - 12:00 CEST | ICM Room 12a
    Session Chairs: Vittorio Bianco, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy), Yanmin Zhu, The Univ. of Hong Kong (Hong Kong, China)
    12621-28
    Author(s): Fabiana Graziano, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle (Italy), Univ. degli Studi della Campania Luigi Vanvitelli (Italy); Ciro Tortora, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle (Italy), Univ. degli Studi di Napoli Federico II (Italy); Veronica Vespini, Massimo Rippa, Simonetta Grilli, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle (Italy); Roberto Marani, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato, Consiglio Nazionale del (Italy); Sara Coppola, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle (Italy); Ettore Stella, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato, Consiglio Nazionale del (Italy); Pietro Ferraro, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle (Italy)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    TIn recent years, composite materials have found numerous applications due to their mechanical characteristics and properties. The present work is focused on the characterization of composite materials using nondestructive techniques (NDT). Shearography and t thermography are used as nondestructive methods. The former, is an optical interferometric method for the detection of surface or sub-surface defects, the latter is a diagnostic technique for determining surface temperature and to understand the health status of the investigated object. Their use offers advantages related to visualization and testing of end products, noncontact nature, nondestructive and areal operating principle, rapid response, high sensitivity, resolution and accuracy
    12621-29
    Author(s): Yunhui Gao, Feng Yang, Liangcai Cao, Tsinghua Univ. (China)
    29 June 2023 • 10:50 - 11:20 CEST | ICM Room 12a
    Show Abstract + Hide Abstract
    Lensless microscopy is an emerging imaging modality that overcomes the inherent limitation of conventional lens-based optics, especially in terms of imaging throughput, functionality, and cost-effectiveness. Pixel super-resolution phase retrieval serves as the key underlying technique for reconstructing high-resolution holographic images from the raw measurements. In this talk, we revisit lensless microscopy from a computational imaging perspective. A unified mathematical framework is established and the encoding and decoding mechanisms of the phase and subpixel information are analyzed. Regularization and Nesterov’s momentum techniques are introduced to speed up the data acquisition and reconstruction procedures, respectively. The proposed algorithms are verified through a proof-of-concept lensless on-chip microscope. We experimentally demonstrate the capability of pixel super-resolution phase retrieval techniques in revealing the subpixel and quantitative phase information of complex biological samples.
    12621-30
    Author(s): Marika Valentino, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle Ricerche (Italy), Univ. degli Studi di Napoli Federico II (Italy); Jaromír Běhal, Daniele Pirone, Pasquale Memmolo, Lisa Miccio, Vittorio Bianco, Pietro Ferraro, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle Ricerche (Italy)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    Microplastics (MPs) are abundant and not degradable pollutants of marine waters. Here we show how Digital Holography (DH) coupled to machine learning can be used to identify and characterize MPs in water samples. We show that features extracted from the wrapped phase-contrast map are key elements for effective identification of microplastic items of various shapes and sizes. We apply the fractal geometry framework to phase-contrast images to discern between several classes of microplastics and microalgae. We also show that a polarization-resolved holographic flow cytometer can use plastic birefringence properties to identify different synthetic and natural microfibers with material specificity.
    12621-31
    Author(s): Jan Rothhardt, Helmholtz Institute Jena (Germany); Wilhelm Eschen, Cheng Liu, Daniel S. Penagos M., Friedrich-Schiller-Univ. Jena (Germany), Helmholtz-Institute Jena (Germany); Robert Klas, Jens Limpert, Friedrich-Schiller-Univ. Jena (Germany), Helmholtz-Institute Jena (Germany), Fraunhofer Institute for Applied Optics and Precision Engineering (Germany)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    I will present recent results obtained by table-top EUV ptychography microscopy based on a compact high-harmonic generation source providing a high coherent power at 13.5 nm wavelength. Sub-20 nm resolution is demonstrated by utilizing a combination of structured illumination and an advanced iterative phase retrieval engine. We achieve nanoscale and material-specific EUV imaging and apply this technique to a variety of samples from life and material sciences. EUV ptychography combines high resolution, strong material contrast, reasonable penetration depth and easy sample preparation. It thus has the potential to bridges the gap between visible-light and electron microscopy in many application fields.
    Break
    Lunch Break 12:00 - 13:10
    Session 7: Full-field Imaging and Applications II
    29 June 2023 • 13:10 - 14:20 CEST | ICM Room 12a
    Session Chairs: Vittorio Bianco, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy), Yanmin Zhu, The Univ. of Hong Kong (Hong Kong, China)
    12621-32
    Author(s): Vittorio Bianco, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle (Italy); Jaromír Běhal, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle (Italy), Univ. degli Studi di Napoli Federico II (Italy); Pier Luigi Mazzeo, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle (Italy); Marika Valentino, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle (Italy), Univ. degli Studi di Napoli Federico II (Italy); Paolo Spagnolo, Lisa Miccio, Cosimo Distante, Pietro Ferraro, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello", Consiglio Nazionale delle (Italy)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    Achieving clear vision through smoke and flames is a highly pursued goal to better manage intervention priorities and to allow first responders operating safely during fire accidents. Here we show different classes of active far-infrared systems to image static/moving targets through fire with different imaging performance and field-portability characteristics. Low-coherence infrared systems and high-coherence holographic sensors will be discussed. We show that a pre-trained convolutional neural network can detect the presence of a person hidden behind fire in real-time, accurately, even when the system is not able to reject the flame contributions in full, being suitable for video-surveillance applications.
    12621-33
    Author(s): Yanmin Zhu, Yuxing Li, Jianqing Huang, Yunping Zhang, Edmund Y. Lam, The University of Hong Kong (Hong Kong, China)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    The assessment of microplastics (MPs) pollution and water quality monitoring raise a lot of attention in recent years. Discriminative methods are highly needed for quick and accurate in situ MP detections. Digital holography records the wavefront information of the objects and contains the morphology, refractive index, and roughness information. Polarization imaging inspects the optical anisotropy of MPs, which is related to their birefringence and material characteristics. In this work, we explore the capability of holographic and polarization imaging for the identification of MPs. The computed features, such as the angle of polarization (AoP) and degree of linear polarization (DoLP), show distinguishable characteristics of MPs. We inspect the method feasibility on MP classification as well as biological and natural particles. The proposed method shows potential use in real-time, non-contact in situ MPs detection and water pollution monitoring.
    12621-46
    Author(s): Omar De Mitri, Andreas Frommknecht, Marco F. Huber, Felix Müller-Graf, Fraunhofer-Institut für Produktionstechnik und Automatisierung IPA (Germany); Cosimo Distante, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    Weld quality inspection allows the detection of defects that may compromise the quality and strength of the weld. Although visual optical inspection offers lower reliability than other non-destructive methods, it enables weld analysis at a significantly lower cost. In this context, developing machine learning-based algorithms for automatic optical weld quality recognition requires acquiring large amounts of data for training. This entails high costs in terms of time, material and energy required for test preparation. However, one possible approach to tackling the problem with limited datasets is to use synthetic data. Using such data increases the amount and variety of data available to the detection algorithm. With a focus on the context of welding, this paper presents an approach that uses synthetic data as a form of data augmentation to improve the performance of the optical detection of weld seams. Specifically, we propose a generative neural network for semantic image synthesis using a limited starting dataset. The network generates new data instances by receiving as input a semantic map of the image to be represented. Weld defects such as porosity or weld spatter are added to the semantic map so that the network synthesizes corresponding defect images. Analysing the performance on a segmentation network, experimental results show how adding synthetic data to the original data can ensure improvements in network performance.
    Session 8: Multimodal Sensing Applications II
    29 June 2023 • 14:20 - 17:30 CEST | ICM Room 12a
    Session Chair: Francesco Soldovieri, Istituto per il Rilevamento Elettromagnetico dell'Ambiente (Italy)
    12621-35
    Author(s): Francesco C. Morabito, Univ. Mediterranea di Reggio Calabria (Italy)
    29 June 2023 • 14:20 - 14:50 CEST | ICM Room 12a
    Show Abstract + Hide Abstract
    AI methodologies are strongly impacting on many fields, including computer vision and imaging. However, various limitations have been considered in practical contexts, e.g., paucity of data in space applications, high energy consumption for training, poor generalization in real scenarios, etc. In this work, we will focus on recently proposed techniques of meta-learning to face some of these drawbacks. The concept of multimodal imaging will be analyzed for improving performance and explanations of ML/DL models.
    12621-36
    Author(s): Zheng Liu, Qichao Wang, Rico Nestler, Gunther Notni, Technische Univ. Ilmenau (Germany)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    In electronics manufacturing, the inspection of defects of electrical components on printed circuit boards (SMD-PCB) is an import part of the production chain. This process is normally implemented by automatic optical inspection (AOI) systems based on classical computer vision and multimodal imaging. Despite the highly developed image processing, misclassifications can occur due to the different, variable appearance of objects and defects and constantly emerging defect types, which can only be avoided by constant manual supervision and adaption. Therefore, a lot of manpower is needed to do this or to perform a subjective follow-up. In this paper, we present a new method using the principle of multimodal deep learning-based one-class novelty-detection to support AOIs and operators to detect defects more accurate or to determine whether something needs to be changed. By combining with a given AOI classification a powerful adaptive AOI system can be realized. To evaluate the performance of the multimodal novelty-detector, we conducted experiments with SMD-PCB-components imaged in texture and geometric modalities. Based on the idea of one-class-detection only normal data is needed to form training sets. Annotated defect data which is normally only insufficiently available, is only used in the tests. We report about some experiments in accordance with the consistence of data categories to investigate the applicability of this approach in different scenarios. Hereby we compared different state-of-the-art one-class novelty detection techniques using image data of different modalities. Besides the influence of different data fusion methods are discussed to find a good way to use this data and to show the benefits using multimodal data. Our experiments show an outstanding performance of defect detection using multimodal data based on our approach. Our best value of the widely known AUROC reaches more than 0.99 with real test data.
    12621-37
    Author(s): Laura Romeo, Rosa Pia Devanna, Roberto Marani, Giovanni Matranga, Marcella Biddoccu, Annalisa Milella, Consiglio Nazionale delle Ricerche (Italy)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    In-field sensing systems for automatic yield monitoring are gaining increasing importance as they promise to give a considerable boost in production. The development of artificial intelligence and sensing technologies to assist the human workforce also meets sustainability needs, which impact the ecological goals of current and future agricultural processes. In this context, image acquisition and processing systems are widely adopted to extract useful information for farmers. Although RGB-D cameras have been used in many applications for ground-based proximal sensing, relatively few works can be found that include depth information in image analysis. In this work, both semantic and depth information from RGB-D vineyard images is used in processing pipeline composed of a decision tree algorithm and a deep learning model. The goal is to reach coherent semantic segmentation of a set of natural images acquired at both long and short distances, using a low-cost RGB-D camera in an experimental vineyard. Depth information of each image is fed into a decision tree to predict the distance of the acquired vines from the camera. Before feeding the deep learning models, the images to be segmented are manipulated according to the predicted distance. The results of semantic segmentation with and without using the decision tree are compared, showing how depth information appears to be highly relevant in enhancing the accuracy and precision of the predicted semantic maps.
    Coffee Break 15:30 - 16:00
    12621-38
    Author(s): Rufei Zou, Qini Ge, Beijing TaiGeek Technology Co., Ltd. (China); Yuexiang Peng, Beijing Univ. of Technology (China)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    We implement the multimodal characterization for bolted joint surface. A reflection digital holographic microscope (RDHM) has been combined with laser triangulation measurement for achieving quantitative micro-deformation description of the connecting region between bolts and metal plate. The calibrated laser triangulation instrument is installed on the electronically controlled translation platform for two-dimensional scanning. A structure extraction algorithm is proposed to realize fast 3D reconstruction of the scanned area. The profiles obtained by triangulation are used, as ground truth, to calibrate the measured reflective holographic morphology. The triangulation will complement digital holography well.
    12621-39
    Author(s): Rosa Pia Devanna, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato, Consiglio Nazionale delle Ricerche (Italy); Giulio Reina, Politecnico di Bari (Italy); Annalisa Milella, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato, Consiglio Nazionale del (Italy)
    On demand | Presented live 29 June 2023
    12621-40
    Author(s): Lars Loetgering, ZEISS Research Microscopy Solutions (Germany)
    29 June 2023 • 16:50 - 17:10 CEST | ICM Room 12a
    Show Abstract + Hide Abstract
    Conventional (CP) and Fourier (FP) ptychography have emerged as versatile quantitative phase imaging techniques. While the main application cases for each technique are different, namely lens-less short wavelength imaging for CP and lens-based visible light imaging for FP, both methods share a common algorithmic ground. CP and FP have in part independently evolved to include experimentally robust forward models and inversion techniques. This separation has resulted in a plethora of algorithmic extensions, some of which have not crossed the boundary from one modality to the other. Here, we describe the principle of reciprocity in ptychography. This principle allows for flexible conversion between CP and FP. Moreover, we present an open source, crossplatform software, called PtyLab, enabling a unified CP and FP data analysis. With this framework, we aim to facilitate and accelerate cross-pollination between the two techniques. Moreover, the availability in Matlab, Python, and Julia will set a low barrier to enter each field.
    12621-41
    Author(s): Stefano Mutti, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato, Consiglio Nazionale delle Ricerche (Italy); Giovanni Dimauro, Univ. degli Studi di Bari Aldo Moro (Italy); Nicola Pedrocchi, Sistemi e Tecnologie Industriali Intelligenti per il Manifattuiero Avanzato, Consiglio Nazionale delle Ricerche (Italy)
    On demand | Presented live 29 June 2023
    Show Abstract + Hide Abstract
    Many robotics and computer vision scenarios employ fiducial markers as a pose estimation tool to perform closed-loop control or data collection due to their ease of installation and acquisition. Human-robot collaboration and mobile robotics are a few of many examples in which fiducial markers serve as a position estimation tool and where the estimation frequency characterizes the control algorithm employed in terms of performance. The currently available tools are not keeping up with the advancement in acquisition systems, where the raising of image resolution and increasing acquisition rate harden the fiducial marker detection computation. In this work, we propose a method to model and forecast the marker position, leveraging the kinematic properties of the system on which is fixed and estimating its state during the acquisition. The state of the marker, composed of its position and velocity, is updated by an Unscented Kalman Filter(UKF) by using the information extracted from the previous marker estimations and taking into account the kinematic instantaneous configuration. The UKF, knowing the kinematic specification in terms of velocity and positional constraints, will aid the forecast of the marker trajectory. In such a manner, the forecasted position of the marker is used to define a dynamic region of interest that aids the marker localization procedure and improves computation time. The method is then compared to the classical implementation for different fiducial marker systems.
    Conference Chair
    CNR (Italy)
    Conference Co-Chair
    Istituto per il Rilevamento Elettromagnetico dell'Ambiente (Italy)
    Conference Co-Chair
    The Univ. of Warwick (United Kingdom)
    Conference Co-Chair
    Nanyang Technological Univ. (Singapore)
    Program Committee
    Technische Univ. Delft (Netherlands)
    Program Committee
    Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy)
    Program Committee
    Institut Fresnel (France)
    Program Committee
    Univ. del Salento (Italy)
    Program Committee
    Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy)
    Program Committee
    Liège Univ. (Belgium)
    Program Committee
    Univ. degli Studi di Napoli Federico II (Italy)
    Program Committee
    UFRN (Brazil)
    Program Committee
    Univ. of Bath (United Kingdom)
    Program Committee
    Aalborg Univ. (Denmark)
    Program Committee
    CNR (Italy)
    Program Committee
    Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy)
    Program Committee
    Clive Roberts
    The Univ. of Birmingham (United Kingdom)
    Program Committee
    Mines Alès (France)
    Program Committee
    Rocco Zito
    Flinders Univ. (Australia)
    Additional Information

    View call for papers

     

    What you will need to submit

    • Title
    • Author(s) information
    • Speaker biography
    • 250-word abstract for technical review
    • 100-word summary for the program
    • Keywords used in search for your paper (optional)
    Note: Only original material should be submitted. Commercial papers, papers with no new research/development content, and papers with proprietary restrictions will not be accepted for presentation.