skip to main content
Volume 15, Issue 3June 2024
Editor:
  • Huan Liu
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
ISSN:2157-6904
EISSN:2157-6912
Bibliometrics
Skip Table Of Content Section
SECTION: Survey Papers
survey
A Survey on Evaluation of Large Language Models
Article No.: 39, Pages 1–45https://doi.org/10.1145/3641289

Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing to their unprecedented performance in various applications. As LLMs continue to play a vital role in both research and daily use, their evaluation becomes ...

survey
Deep Learning in Single-cell Analysis
Article No.: 40, Pages 1–62https://doi.org/10.1145/3641284

Single-cell technologies are revolutionizing the entire field of biology. The large volumes of data generated by single-cell technologies are high dimensional, sparse, and heterogeneous and have complicated dependency structures, making analyses using ...

SECTION: Regular Papers
research-article
Open Access
MGRR-Net: Multi-level Graph Relational Reasoning Network for Facial Action Unit Detection
Article No.: 41, Pages 1–20https://doi.org/10.1145/3643863

The Facial Action Coding System (FACS) encodes the action units (AUs) in facial images, which has attracted extensive research attention due to its wide use in facial expression analysis. Many methods that perform well on automatic facial action unit (AU) ...

research-article
Bayesian Strategy Networks Based Soft Actor-Critic Learning
Article No.: 42, Pages 1–24https://doi.org/10.1145/3643862

A strategy refers to the rules that the agent chooses the available actions to achieve goals. Adopting reasonable strategies is challenging but crucial for an intelligent agent with limited resources working in hazardous, unstructured, and dynamic ...

research-article
Internal Rehearsals for a Reconfigurable Robot to Improve Area Coverage Performance
Article No.: 43, Pages 1–17https://doi.org/10.1145/3643854

Reconfigurable robots are deployed for applications demanding area coverage, such as cleaning and inspections. Reconfiguration per context, considering beyond a small set of predefined shapes, is crucial for area coverage performance. However, the ...

research-article
Guidelines for the Regularization of Gammas in Batch Normalization for Deep Residual Networks
Article No.: 44, Pages 1–20https://doi.org/10.1145/3643860

L2 regularization for weights in neural networks is widely used as a standard training trick. In addition to weights, the use of batch normalization involves an additional trainable parameter γ, which acts as a scaling factor. However, L2 regularization ...

research-article
Multimodal Dialogue Systems via Capturing Context-aware Dependencies and Ordinal Information of Semantic Elements
Article No.: 45, Pages 1–25https://doi.org/10.1145/3645099

The topic of multimodal conversation systems has recently garnered significant attention across various industries, including travel and retail, among others. While pioneering works in this field have shown promising performance, they often focus solely ...

research-article
Open Access
CACTUS: A Comprehensive Abstraction and Classification Tool for Uncovering Structures
Article No.: 46, Pages 1–23https://doi.org/10.1145/3649459

The availability of large datasets is providing the impetus for driving many current artificial intelligent developments. However, specific challenges arise in developing solutions that exploit small datasets, mainly due to practical and cost-effective ...

research-article
Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation
Article No.: 47, Pages 1–30https://doi.org/10.1145/3649458

Recent advancement in deep-neural network performance led to the development of new state-of-the-art approaches in numerous areas. However, the black-box nature of neural networks often prohibits their use in areas where model explainability and model ...

research-article
Learning Cross-modality Interaction for Robust Depth Perception of Autonomous Driving
Article No.: 48, Pages 1–26https://doi.org/10.1145/3650039

As one of the fundamental tasks of autonomous driving, depth perception aims to perceive physical objects in three dimensions and to judge their distances away from the ego vehicle. Although great efforts have been made for depth perception, LiDAR-based ...

research-article
Tapestry of Time and Actions: Modeling Human Activity Sequences Using Temporal Point Process Flows
Article No.: 49, Pages 1–27https://doi.org/10.1145/3650045

Human beings always engage in a vast range of activities and tasks that demonstrate their ability to adapt to different scenarios. These activities can range from the simplest daily routines, like walking and sitting, to multi-level complex endeavors such ...

research-article
Deconfounded Cross-modal Matching for Content-based Micro-video Background Music Recommendation
Article No.: 50, Pages 1–25https://doi.org/10.1145/3650042

Object-oriented micro-video background music recommendation is a complicated task where the matching degree between videos and background music is a major issue. However, music selections in user-generated content (UGC) are prone to selection bias caused ...

research-article
MHGCN+: Multiplex Heterogeneous Graph Convolutional Network
Article No.: 51, Pages 1–25https://doi.org/10.1145/3650046

Heterogeneous graph convolutional networks have gained great popularity in tackling various network analytical tasks on heterogeneous graph data, ranging from link prediction to node classification. However, most existing works ignore the relation ...

research-article
A Game-theoretic Framework for Privacy-preserving Federated Learning
Article No.: 52, Pages 1–35https://doi.org/10.1145/3656049

In federated learning, benign participants aim to optimize a global model collaboratively. However, the risk of privacy leakage cannot be ignored in the presence of semi-honest adversaries. Existing research has focused either on designing protection ...

research-article
Self-supervised Bipartite Graph Representation Learning: A Dirichlet Max-margin Matrix Factorization Approach
Article No.: 53, Pages 1–24https://doi.org/10.1145/3645098

Bipartite graph representation learning aims to obtain node embeddings by compressing sparse vectorized representations of interactions between two types of nodes, e.g., users and items. Incorporating structural attributes among homogeneous nodes, such as ...

research-article
Empowering Predictive Modeling by GAN-based Causal Information Learning
Article No.: 54, Pages 1–19https://doi.org/10.1145/3652610

Generally speaking, we can easily specify many causal relationships in the prediction tasks of ubiquitous computing, such as human activity prediction, mobility prediction, and health prediction. However, most of the existing methods in these fields ...

research-article
A Meta-Learning Framework for Tuning Parameters of Protection Mechanisms in Trustworthy Federated Learning
Article No.: 55, Pages 1–36https://doi.org/10.1145/3652612

Trustworthy federated learning typically leverages protection mechanisms to guarantee privacy. However, protection mechanisms inevitably introduce utility loss or efficiency reduction while protecting data privacy. Therefore, protection mechanisms and ...

research-article
Open Access
Ensuring Fairness and Gradient Privacy in Personalized Heterogeneous Federated Learning
Article No.: 56, Pages 1–30https://doi.org/10.1145/3652613

With the increasing tension between conflicting requirements of the availability of large amounts of data for effective machine learning-based analysis, and for ensuring their privacy, the paradigm of federated learning has emerged, a distributed machine ...

research-article
Open Access
FedCMD: A Federated Cross-modal Knowledge Distillation for Drivers’ Emotion Recognition
Article No.: 57, Pages 1–27https://doi.org/10.1145/3650040

Emotion recognition has attracted a lot of interest in recent years in various application areas such as healthcare and autonomous driving. Existing approaches to emotion recognition are based on visual, speech, or psychophysiological signals. However, ...

research-article
Perceiving Actions via Temporal Video Frame Pairs
Article No.: 58, Pages 1–20https://doi.org/10.1145/3652611

Video action recognition aims at classifying the action category in given videos. In general, semantic-relevant video frame pairs reflect significant action patterns such as object appearance variation and abstract temporal concepts like speed, rhythm, ...

research-article
Open Access
Score-based Graph Learning for Urban Flow Prediction
Article No.: 59, Pages 1–25https://doi.org/10.1145/3655629

Accurate urban flow prediction (UFP) is crucial for a range of smart city applications such as traffic management, urban planning, and risk assessment. To capture the intrinsic characteristics of urban flow, recent efforts have utilized spatial and ...

research-article
Open Access
HydraGAN: A Cooperative Agent Model for Multi-Objective Data Generation
Article No.: 60, Pages 1–21https://doi.org/10.1145/3653982

Generative adversarial networks have become a de facto approach to generate synthetic data points that resemble their real counterparts. We tackle the situation where the realism of individual samples is not the sole criterion for synthetic data ...

SECTION: Best of Wisdom 2022
research-article
Quintuple-based Representation Learning for Bipartite Heterogeneous Networks
Article No.: 61, Pages 1–19https://doi.org/10.1145/3653978

Recent years have seen rapid progress in network representation learning, which removes the need for burdensome feature engineering and facilitates downstream network-based tasks. In reality, networks often exhibit heterogeneity, which means there may ...

research-article
Open Access
Analysing Utterances in LLM-Based User Simulation for Conversational Search
Article No.: 62, Pages 1–22https://doi.org/10.1145/3650041

Clarifying underlying user information needs by asking clarifying questions is an important feature of modern conversational search systems. However, evaluation of such systems through answering prompted clarifying questions requires significant human ...

Subjects

Comments