Issue Downloads
EffCause: Discover Dynamic Causal Relationships Efficiently from Time-Series
Since the proposal of Granger causality, many researchers have followed the idea and developed extensions to the original algorithm. The classic Granger causality test aims to detect the existence of the static causal relationship. Notably, a fundamental ...
Asymmetric Learning for Graph Neural Network based Link Prediction
Link prediction is a fundamental problem in many graph-based applications, such as protein-protein interaction prediction. Recently, graph neural network (GNN) has been widely used for link prediction. However, existing GNN-based link prediction (GNN-LP) ...
Multi-Task Learning with Sequential Dependence Toward Industrial Applications: A Systematic Formulation
Multi-task learning (MTL) is widely used in the online recommendation and financial services for multi-step conversion estimation, but current works often overlook the sequential dependence among tasks. In particular, sequential dependence multi-task ...
Package Arrival Time Prediction via Knowledge Distillation Graph Neural Network
Accurately estimating packages’ arrival time in e-commerce can enhance users’ shopping experience and improve the placement rate of products. This problem is often formalized as an Origin-Destination (OD)-based ETA (i.e., estimated time of arrival) ...
Attacking Click-through Rate Predictors via Generating Realistic Fake Samples
How to construct imperceptible (realistic) fake samples is critical in adversarial attacks. Due to the sample feature diversity of a recommender system (containing both discrete and continuous features), traditional gradient-based adversarial attack ...
FiFrauD: Unsupervised Financial Fraud Detection in Dynamic Graph Streams
Given a stream of financial transactions between traders in an e-market, how can we accurately detect fraudulent traders and suspicious behaviors in real time? Despite the efforts made in detecting these fraudsters, this field still faces serious ...
Prerequisite-Enhanced Category-Aware Graph Neural Networks for Course Recommendation
The rapid development of Massive Open Online Courses (MOOCs) platforms has created an urgent need for an efficient personalized course recommender system that can assist learners of all backgrounds and levels of knowledge in selecting appropriate courses. ...
Domain Generalization in Time Series Forecasting
Domain generalization aims to design models that can effectively generalize to unseen target domains by learning from observed source domains. Domain generalization poses a significant challenge for time series data, due to varying data distributions and ...
TaSPM: Targeted Sequential Pattern Mining
Sequential pattern mining (SPM) is an important technique in the field of pattern mining, which has many applications in reality. Although many efficient SPM algorithms have been proposed, there are few studies that can focus on targeted tasks. Targeted ...
Networked Time-series Prediction with Incomplete Data via Generative Adversarial Network
A networked time series (NETS) is a family of time series on a given graph, one for each node. It has a wide range of applications from intelligent transportation to environment monitoring to smart grid management. An important task in such applications ...
CoBjeason: Reasoning Covered Object in Image by Multi-Agent Collaboration Based on Informed Knowledge Graph
Object detection is a widely studied problem in existing works. However, in this paper, we turn to a more challenging problem of “Covered Object Reasoning”, aimed at reasoning the category label of target object in the given image particularly when it has ...
X-distribution: Retraceable Power-law Exponent of Complex Networks
Network modeling has been explored extensively by means of theoretical analysis as well as numerical simulations for Network Reconstruction (NR). The network reconstruction problem requires the estimation of the power-law exponent (γ) of a given input ...
Supervised Clustering of Persian Handwritten Images Using Regularization and Dimension Reduction Methods
Clustering, as a fundamental exploratory data technique, not only is used to discover patterns and structures in complex datasets but also is utilized to group variables in high-dimensional data analysis. Dimension reduction through clustering helps ...
Graph Time-series Modeling in Deep Learning: A Survey
Time-series and graphs have been extensively studied for their ubiquitous existence in numerous domains. Both topics have been separately explored in the field of deep learning. For time-series modeling, recurrent neural networks or convolutional neural ...
A Survey on AutoML Methods and Systems for Clustering
Automated Machine Learning (AutoML) aims to identify the best-performing machine learning algorithm along with its input parameters for a given dataset and a specific machine learning task. This is a challenging problem, as the process of finding the best ...
X-FSPMiner: A Novel Algorithm for Frequent Similar Pattern Mining
- Ansel Y. Rodríguez-González,
- Ramón Aranda,
- Miguel Á. Álvarez-Carmona,
- Angel Díaz-Pacheco,
- Rosa María Valdovinos Rosas
Frequent similar pattern mining (FSP mining) allows for finding frequent patterns hidden from the classical approach. However, the use of similarity functions implies more computational effort, necessitating the development of more efficient algorithms ...
Multi-Instance Learning with One Side Label Noise
Multi-instance Learning (MIL) is a popular learning paradigm arising from many real applications. It assigns a label to a set of instances, which is called a bag, and the bag’s label is determined by the instances within it. A bag is positive if and only ...
Math Word Problem Generation via Disentangled Memory Retrieval
The task of math word problem (MWP) generation, which generates an MWP given an equation and relevant topic words, has increasingly attracted researchers’ attention. In this work, we introduce a simple memory retrieval module to search related training ...
On the Value of Head Labels in Multi-Label Text Classification
A formidable challenge in the multi-label text classification (MLTC) context is that the labels often exhibit a long-tailed distribution, which typically prevents deep MLTC models from obtaining satisfactory performance. To alleviate this problem, most ...
Towards Differential Privacy in Sequential Recommendation: A Noisy Graph Neural Network Approach
With increasing frequency of high-profile privacy breaches in various online platforms, users are becoming more concerned about their privacy. And recommender system is the core component of online platforms for providing personalized service, ...
Local Community Detection in Multiple Private Networks
Individuals are often involved in multiple online social networks. Considering that owners of these networks are unwilling to share their networks, some global algorithms combine information from multiple networks to detect all communities in multiple ...
Enhancing Out-of-distribution Generalization on Graphs via Causal Attention Learning
In graph classification, attention- and pooling-based graph neural networks (GNNs) predominate to extract salient features from the input graph and support the prediction. They mostly follow the paradigm of “learning to attend,” which maximizes the mutual ...
Incorporating Multi-Level Sampling with Adaptive Aggregation for Inductive Knowledge Graph Completion
In recent years, Graph Neural Networks (GNNs) have achieved unprecedented success in handling graph-structured data, thereby driving the development of numerous GNN-oriented techniques for inductive knowledge graph completion (KGC). A key limitation of ...
A Taxonomy for Learning with Perturbation and Algorithms
Weighting strategy prevails in machine learning. For example, a common approach in robust machine learning is to exert low weights on samples which are likely to be noisy or quite hard. This study summarizes another less-explored strategy, namely, ...
Generation-based Multi-view Contrast for Self-supervised Graph Representation Learning
Graph contrastive learning has made remarkable achievements in the self-supervised representation learning of graph-structured data. By employing perturbation function (i.e., perturbation on the nodes or edges of graph), most graph contrastive learning ...
Mining Top-k High On-shelf Utility Itemsets Using Novel Threshold Raising Strategies
High utility itemsets (HUIs) mining is an emerging area of data mining which discovers sets of items generating a high profit from transactional datasets. In recent years, several algorithms have been proposed for this task. However, most of them do not ...
Conditional Generative Adversarial Network for Early Classification of Longitudinal Datasets Using an Imputation Approach
Early classification of longitudinal data remains an active area of research today. The complexity of these datasets and the high rates of missing data caused by irregular sampling present data-level challenges for the Early Longitudinal Data ...
Scalable and Inductive Semi-supervised Classifier with Sample Weighting Based on Graph Topology
Recently, graph-based semi-supervised learning (GSSL) has garnered significant interest in the realms of machine learning and pattern recognition. Although some of the proposed methods have made some progress, there are still some shortcomings that need ...