This book constitutes the referred proceedings of the Second International Conference on Pattern Analysis and Machine Intelligence, ICPAMI 2025, held in Dalian, China, during August 29–31, 2025.
The 23 papers presented here were carefully reviewed and selected from 51 submissions. Their insightful talks covered the applications, challenges, and future directions of Multimodal Foundation and Large Language Models, the frontiers and perspectives in 3D Spatial Video Reconstruction, and research on defenses and out-of-distribution detection in trustworthy deep learning.
Table of Contents:
.- Research on the Innovative Approaches for Empowering the Inheritance and Development of Intangible Cultural Heritage by Smart Digital Restoration Technology.
.- Leveraging a Cross-supervision SAM for Weakly-Supervised Camouflaged Object Detection.
.- CSSeg: Cross-Supervision for Unsupervised Semantic Segmentation.
.- Optimization Path for Chinese Portrait Generation with AIGC via SAM Segmentation and LoRA Fine-Tuning.
.- Improve the joint image encryption echnology of JTC and FRFT.
.- Multiple Group Detection and Tracking for Multiple Domain Formation.
.- DSF-DETR:Real-Time End-to-End Object Detection with Multi-Scale Fusion and Lightweight Activation.
.- Online Multi-target Pedestrian Tracking Based on Spatiotemporal Estimation.
.- Research on Multi-Modal Remote Sensing Image Object Detection Based on YOLOv11-AM.
.- SGCC: A knowledge graph construction framework for incomplete schemas.
.- FRAN: Multi-Scale Frequency–Spatial Residual Attention Network for General-Purpose AIGC Image Detection.
.- HIA: Hybrid Interactive Attention for Efficient Remote Sensing Image Super-Resolution.
.- OVIF-YOLO: A Multispectral Object Detection Algorithm for Visible-Infrared Image Fusion.
.- Distributed Weighted Mahalanobis Distance Spectral Clustering on Spark for Large-Scale Data.
.- A Deep Learning and Intelligent Vision-Based Approach for Dental Health Detection.
.- Automatic Sleep Staging via Multi-Modal Graph Learning and Cross-Graph Fusion.
.- MAIC-DF: A Multimodal AI-Powered Career Development Framework with Dynamic Scenario Generation and Proactive Guidance.
.- Risk Stage Embedding: Unsupervised Representation Learning for Enhanced Heart Disease Prediction.
.- Advancing Archaeological Ceramic Image Classification via Few-Shot Learning.
.- A method for acquiring X-ray pulsar observation profiles based on the Blackman-Tukey method.
.- Beyond KLD: A Symmetric Statistical Loss for Distribution-Aligned Detection.
.- Identifying Critical Nodes in Heterogeneous Networks for IoT Systems Based on Global and Local Centrality.
.- A Review of the Research Progress in Footprint Biometric Recognition.