Idiap Researchers Take Physics-inspired Approach to Face Generation

New research from the Idiap Research Institute’s Biometrics Security and Privacy group offers a novel and effective approach to synthetic face dataset generation that could help to address the challenges of privacy, data diversity, and model performance in the field of face recognition.

The paper, “Synthetic Face Datasets Generation via Latent Space Exploration from Brownian Identity Diffusion”, by David Geissbühler, Hatef Otroshi Shahreza, and Sébastien Marcel, proposes a novel method for dataset generation inspired by the physical motion of particles under Brownian forces to sample identities in latent space under various constraints. The method aims to generate synthetic face datasets that perform on par or better than existing GAN-based datasets and diffusion-based synthetic datasets.

To achieve this, the authors present the Langevin algorithm, which iterates over a random initial set of identities, treating samples as soft spherical particles subject to repulsive forces, a random Brownian force, and a global attractive potential.

The Langevin algorithm is rooted in the Langevin equation, a stochastic differential equation (SDE) used to describe the Brownian motion of particles. In the context of making face datasets for facial recognition training, the identities are represented as particles in a multi-dimensional latent space, and the algorithm applies repulsive forces to ensure that these identities are spread out sufficiently. Simultaneously, an attractive force pulls the identities towards a central point in the latent space to maintain high image quality. This balance helps achieve dense packing of identities in latent space, optimizing for realistic image generation.

The method uses a loss function inspired by granular mechanics, where the potential energy between particles depends on their overlap, leading to repulsive forces that drive them apart if they are too close. The method also introduces a random force component to prevent the identities from getting stuck in local minima, simulating the random collisions experienced by particles in Brownian motion.

The overall effect is that the identities are distributed throughout the latent space, maximizing their diversity.

Beyond the Langevin algorithm, the authors develop two additional algorithms, Dispersion and DisCo, to generate intra-class variations, ensuring diversity within each identity class. Dispersion focuses on generating multiple variations of a single identity by sampling around a reference latent vector and optimizing these samples to remain close in embedding space while still varying in appearance. DisCo combines the principles of Dispersion with covariate adjustments, such as pose or lighting changes, to further enhance the diversity of the generated faces.

The paper demonstrates the effectiveness of these algorithms by benchmarking synthetic datasets against real-world datasets. The results show that models trained on data generated by the Langevin method perform better than those trained on previously GAN-based datasets and achieve competitive results with state-of-the-art diffusion-based datasets. The generated datasets help mitigate privacy issues by preventing leakage from the generator’s training set, which is a significant concern with traditional datasets.

Applying this method to facial recognition, the authors explore various parameters that influence the quality and performance of the synthetic datasets. They find that adjusting the repulsion distance thresholds and the number of Langevin iterations significantly impacts the FR model’s accuracy. For example, increasing the number of iterations allows the identities to better optimize their positions in latent space, resulting in more diverse and realistic face images. The research also highlights the computational efficiency of the Langevin method compared to traditional random-reject sampling, providing a scalable solution for large dataset generation.

This physics-inspired research opens new avenues for leveraging synthetic data in machine learning while ensuring ethical and privacy considerations are met. The full paper is available through arXiv.

Source: arXiv

–

May 15, 2024 – by Cass Kennedy

Related News

Partners

FaceTec’s patented, industry-leading 3D Face Verification and Reverification software anchors digital identity, creating a chain of trust from user onboarding to ongoing authentication on all modern smart devices and webcams. FaceTec’s 3D FaceMaps™ finally make trusted, remote identity verification possible. As the only technology backed by a persistent spoof bounty program and NIST/iBeta Certified Liveness Detection, FaceTec is the global standard for 3D Liveness and Face Matching with millions of users on six continents in financial services, border security, transportation, blockchain, e-voting, social networks, online dating and more. www.facetec.com

AuthenticID provides 100% automated identity verification and fraud detection solutions that are leveraged by companies worldwide, including 2 of the top 3 U.S. Banks, 8 out the top 10 wireless providers in North America, and 2 of the 3 credit bureaus. Using proprietary computer vision and machine learning technology, these solutions help companies accurately verify the identity of their users across retail, digital and call center environments for onboarding and ongoing re-authentication events; KYC, IAM, and more. The solutions are easy to integrate and provide customers a large ROI by stopping fraud losses, increasing customer conversion at onboarding, reducing operational costs and allowing quick and cost-effective operational scalability, all while ensuring global privacy regulations are complied with. https://www.authenticid.com/

Anonybit is a privacy-focused technology platform that provides decentralized solutions for securing personal data, particularly biometric information. Rather than storing sensitive data in centralized repositories, which are vulnerable to breaches, Anonybit uses a distributed architecture that breaks data into smaller encrypted bits and stores them across a network of decentralized nodes. This approach ensures that no single entity has access to the full dataset, enhancing privacy and security, even preventing insider threats. Anonybit is used by leading banks, fintechs and other enterprises for critical biometric identity functions like deduplication and blocklist checks, step up authentication, passwordless login and account recovery https://anonybit.io/

Identity Week aims to be a significant identity industry catalyst. It’s our mission is to help accelerate the move towards a world where trusted identity solutions enable governments and commercial organisations to provide citizens, employees, customers and consumers with a multitude of opportunities to transact in a seamless, yet secure manner. All the while preventing the efforts of those intent on doing harm. https://identityweek.net/

The Biometric Digital Identity Prism is a market landscape framework designed to help influencers and decision makers understand, innovate, and implement digital identity technologies and solutions. This innovative framework for understanding and evaluating the rapidly evolving biometric digital identity marketplace is the only market model that is truly biometric-centric based on the foundational conviction that in the age of digital transformation the only true, reliable link between humans and their digital data is biometrics. https://www.the-prism-project.com

ID R&D is an innovator of biometric facial liveness, document liveness, and voice biometrics. Ranked #1 by NIST, our patented passive approach to liveness, and specialized detection of voice clones, injection attacks and deepfakes, empowers KYC and authentication systems with fast, accurate and secure biometric verification technologies. Over 140 partners in more than 70 countries are collectively processing hundreds of millions of identity checks per year. ID R&D solutions easily integrate with mobile, web, messaging, smart speakers, set-top boxes, and IoT devices. Learn more: www.idrnd.ai

Related News

Footer

Follow Us