Robust Image Descriptor for Machine Learning based Data Reduction in Crystallography

19 Sept 2022, 17:15
15m
Seminarraum 1-3 (CFEL (Building 99))

Seminarraum 1-3

CFEL (Building 99)

Luruper Chaussee 149 22761 Hamburg Germany
Contributed Talk (15 min) Nano and Material Science Nano and Material Science

Speaker

Dr Vahid Rahmani (DESY)

Description

Recent serial crystallography experiments at FELs produce a large amount of data, where typically the ratio of useful images containing crystal diffraction (hit fraction) is about 5-10% but hit fractions even lower than 0.1 % have been observed in some experiments. Demands on data storage could be greatly reduced by rejecting bad images before saving them to disk, but this requires reliable methods for detecting these images that do not rely on expert tuning or intervention during the experiment. Traditional Non-Machine learning techniques successfully classify good and bad images based on number of detected peak on the images. But those techniques require tuning many parameters that are vary in each experiment. Also traditional machine learning methods like artificial neural networks successfully classify diffraction patterns, but the major challenge is cross-domain performance, in which a classifier trained by a set of local feature vectors extracted on one dataset cannot necessarily be applied to data collected with different samples and experimental settings. Scale-invariance, good localization, and robustness to noise and artifacts are the main properties that a local feature detector should possess. In this paper, we propose a real-time, automatic, and parameter-free image descriptor as a local feature detector for use as input data for ML models. Our method describes each diffraction pattern by a vector, consisting of the number of keypoints (Bragg spots) in different areas of the image. Additionally, we parallelized our novel and parameter-free keypoint detection algorithm to work computationally efficiently compared to the traditional keypoint detection algorithms. A machine learning model trained by this introduced image descriptor(vector) performs well across different experimental settings(cross-domain) to classify diffraction patterns as hit and miss diffraction patterns. Our initial experimental results show a significant improvement in decreasing the domain gap when a ML classifier is trained by new image descriptor and tested by another unseen dataset.

Primary author

Dr Vahid Rahmani (DESY)

Co-authors

Dr Shah Nawaz (DESY) Mr Shabarish Pala Ramakantha Setty (DESY) Mr Jaisinh Bhosale (Technical University of Hamburg) Dr David Pennicard (DESY) Prof. Heinz Graafsma (DESY)

Presentation materials

There are no materials yet.