15 results on '"Yunjun Gao"'
Search Results
2. MTTPRE
- Author
-
Feng Wan, Linsen Li, Ke Wang, Lu Chen, Yunjun Gao, Weihao Jiang, and Shiliang Pu
- Published
- 2022
- Full Text
- View/download PDF
3. ZeroMatcher
- Author
-
Congcong Ge, Xiaocan Zeng, Lu Chen, and Yunjun Gao
- Published
- 2022
- Full Text
- View/download PDF
4. Self-Guided Learning to Denoise for Robust Recommendation
- Author
-
Yunjun Gao, Yuntao Du, Yujia Hu, Lu Chen, Xinjun Zhu, Ziquan Fang, and Baihua Zheng
- Subjects
FOS: Computer and information sciences ,Information Retrieval (cs.IR) ,Computer Science - Information Retrieval - Abstract
The ubiquity of implicit feedback makes them the default choice to build modern recommender systems. Generally speaking, observed interactions are considered as positive samples, while unobserved interactions are considered as negative ones. However, implicit feedback is inherently noisy because of the ubiquitous presence of noisy-positive and noisy-negative interactions. Recently, some studies have noticed the importance of denoising implicit feedback for recommendations, and enhanced the robustness of recommendation models to some extent. Nonetheless, they typically fail to (1) capture the hard yet clean interactions for learning comprehensive user preference, and (2) provide a universal denoising solution that can be applied to various kinds of recommendation models. In this paper, we thoroughly investigate the memorization effect of recommendation models, and propose a new denoising paradigm, i.e., Self-Guided Denoising Learning (SGDL), which is able to collect memorized interactions at the early stage of the training (i.e., "noise-resistant" period), and leverage those data as denoising signals to guide the following training (i.e., "noise-sensitive" period) of the model in a meta-learning manner. Besides, our method can automatically switch its learning phase at the memorization point from memorization to self-guided learning, and select clean and informative memorized data via a novel adaptive denoising scheduler to improve the robustness. We incorporate SGDL with four representative recommendation models (i.e., NeuMF, CDAE, NGCF and LightGCN) and different loss functions (i.e., binary cross-entropy and BPR loss). The experimental results on three benchmark datasets demonstrate the effectiveness of SGDL over the state-of-the-art denoising methods like T-CE, IR, DeCA, and even state-of-the-art robust graph-based methods like SGCN and SGL., Accepted by SIGIR2022
- Published
- 2022
- Full Text
- View/download PDF
5. Make It Easy
- Author
-
Lu Chen, Yunjun Gao, Congcong Ge, Xiaoze Liu, and Baihua Zheng
- Subjects
Structure (mathematical logic) ,End-to-end principle ,Computer science ,Process (computing) ,Fuse (electrical) ,Graph (abstract data type) ,Competitor analysis ,Data mining ,computer.software_genre ,computer ,Task (project management) ,Utilization - Abstract
Entity alignment (EA) is a prerequisite for enlarging the coverage of a unified knowledge graph. Previous EA approaches either restrain the performance due to inadequate information utilization or need labor-intensive pre-processing to get external or reliable information to perform the EA task. This paper proposes EASY, an effective end-to-end EA framework, which is able to (i) remove the labor-intensive pre-processing by fully discovering the name information provided by the entities themselves; and (ii) jointly fuse the features captured by the names of entities and the structural information of the graph to improve the EA results. Specifically, EASY first introduces NEAP, a highly effective name-based entity alignment procedure, to obtain an initial alignment that has reasonable accuracy and meanwhile does not require much memory consumption or any complex training process. Then, EASY invokes SRS, a novel structure-based refinement strategy, to iteratively correct the misaligned entities generated by NEAP to further enhance the entity alignment. Extensive experiments demonstrate the superiority of our proposed EASY with significant improvement against 13 existing state-of-the-art competitors.
- Published
- 2021
- Full Text
- View/download PDF
6. NAD
- Author
-
Sai Wu, Yunjun Gao, Gang Chen, Zhifei Pang, and Dongxiang Zhang
- Subjects
Engineering drawing ,Textile ,Artificial neural network ,Computer science ,business.industry ,Process (engineering) ,Design process ,Reinforcement learning ,Textile (markup language) ,business ,Task (project management) - Abstract
Textile pattern design is a challenging task that can be hardly resolved by a single deep neural network, due to the requirements on high resolution, periodic tiling, copyright protection and aesthetic preference of designers. In this paper, we present our NAD system which can automatically produce high-quality textile patterns for printing industry. Our NAD system splits the work into three steps: layout design, image filtering and pattern style transfer. In the first and last step, we employ different neural models to learn the process of artwork creation by human designers. Specifically, a reinforcement learning model is first developed for layout adjustment, followed by a CNN-based model for style transfer. We have employed our NAD system in an online production system with real customers and the results are very impressive and promising. The NAD system not only frees human designers from the labor intensive design process, but also results in a 2%-5% daily purchase rate.
- Published
- 2019
- Full Text
- View/download PDF
7. GetReal
- Author
-
Hui Li, Jianfeng Ma, Yunjun Gao, Sourav S. Bhowmick, and Jiangtao Cui
- Subjects
Mathematical optimization ,Group (mathematics) ,Computer science ,business.industry ,Maximization ,Competitor analysis ,Competition (economics) ,symbols.namesake ,Nash equilibrium ,symbols ,Artificial intelligence ,business ,Game theory ,Selection (genetic algorithm) - Abstract
State-of-the-art classical influence maximization (IM) techniques are "competition-unaware" as they assume that a group (company) finds seeds (users) in a network independent of other groups who are also simultaneously interested in finding such seeds in the same network. However, in reality several groups often compete for the same market (e.g., Samsung, HTC, and Apple for the smart phone market) and hence may attempt to select seeds in the same network. This has led to increasing body of research in devising IM techniques for competitive networks. Despite the considerable progress made by these efforts toward finding seeds in a more realistic settings, unfortunately, they still make several unrealistic assumptions (e.g., a new company being aware of a rival's strategy, alternate seed selection, etc.) making their deployment impractical in real-world networks. In this paper, we propose a novel framework based on game theory to provide a more realistic solution to the IM problem in competitive networks by jettisoning these unrealistic assumptions. Specifically, we seek to find the "best" IM strategy (an algorithm or a mixture of algorithms) a group should adopt in the presence of rivals so that it can maximize its influence. As each group adopts some strategy, we model the problem as a game with each group as competitors and the expected influences under the strategies as payoffs. We propose a novel algorithm called GetReal to find each group's best solution by leveraging the competition between different groups. Specifically, it seeks to find whether there exist a Nash Equilibrium (NE) in a game, which guarantees that there exist an "optimal" strategy for each group. Our experimental study on real-world networks demonstrates the superiority of our solution in a more realistic environment.
- Published
- 2015
- Full Text
- View/download PDF
8. Mapping queries to questions
- Author
-
Rui Li, Gang Chen, Lu Chen, and Yunjun Gao
- Subjects
Search engine ,Information retrieval ,Ranking ,Computer science ,Order (business) ,InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL ,Question answering ,Information needs ,computer.software_genre ,computer ,Data integration ,Ranking (information retrieval) - Abstract
In this paper, for the first time, we study the problem of mapping keyword queries to questions on community-based question answering (CQA) sites. Mapping general web queries to questions enables search engines not only to discover explicit and specific information needs (questions) behind keywords queries, but also to find high quality information (answers) for answering keyword queries. In order to map queries to questions, we propose a ranking algorithm containing three steps: Candidate Question Selection, Candidate Question Ranking, and Candidate Question Grouping. Preliminary experimental results using 60 queries from search logs of a commercial engine show that the presented approach can efficiently find the questions which capture user's information needs explicitly.
- Published
- 2013
- Full Text
- View/download PDF
9. Commodity query by snapping
- Author
-
Kevin Chiew, Qinming He, Hao Huang, Lu Chen, and Yunjun Gao
- Subjects
World Wide Web ,Knowledge base ,Computer science ,business.industry ,Pattern recognition (psychology) ,Commodity ,Linked data ,business ,Mobile device ,Task (project management) - Abstract
Commodity information such as prices and public reviews is always the concern of consumers. Helping them conveniently acquire these information as an instant reference is often of practical significance for their purchase activities. Nowadays, Web 2.0, linked data clouds, and the pervasiveness of smart hand held devices have created opportunities for this demand, i.e., users could just snap a photo of any commodity that is of interest at anytime and anywhere, and retrieve the relevant information via their Internet-linked mobile devices. Nonetheless, compared with the traditional keyword-based information retrieval, extracting the hidden information related to the commodities in photos is a much more complicated and challenging task, involving techniques such as pattern recognition, knowledge base construction, semantic comprehension, and statistic deduction. In this paper, we propose a framework to address this issue by leveraging on various techniques, and evaluate the effectiveness and efficiency of this framework with experiments on a prototype.
- Published
- 2013
- Full Text
- View/download PDF
10. Browse with a social web directory
- Author
-
Lu Chen, Yunjun Gao, Kevin Chiew, Qinming He, Hao Huang, and Rui Li
- Subjects
Web standards ,Web analytics ,Web development ,Web 2.0 ,Computer science ,InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL ,computer.software_genre ,Social web ,Social Semantic Web ,World Wide Web ,Web page ,Web design ,Web navigation ,Semantic Web Stack ,Data Web ,Information retrieval ,business.industry ,Static web page ,Web application security ,Web mining ,Web search engine ,Web mapping ,Web service ,business ,computer ,Site map - Abstract
Browse with either web directories or social bookmarks is an important complementation to search by keywords in web information retrieval. To improve users' browse experiences and facilitate the web directory construction, in this paper, we propose a novel browse system called Social Web Directory (SWD for short) by integrating web directories and social bookmarks. In SWD, (1) web pages are automatically categorized to a hierarchical structure to be retrieved efficiently, and (2) the popular web pages, hottest tags, and expert users in each category are ranked to help users find information more conveniently. Extensive experimental results demonstrate the effectiveness of our SWD system.
- Published
- 2013
- Full Text
- View/download PDF
11. On efficient obstructed reverse nearest neighbor query processing
- Author
-
Chun Chen, Jiacheng Yang, Yunjun Gao, Baihua Zheng, and Gang Chen
- Subjects
Best bin first ,Nearest neighbor graph ,Computer science ,Nearest-neighbor chain algorithm ,Nearest neighbor search ,Shortest path problem ,Data mining ,Fixed-radius near neighbors ,computer.software_genre ,computer ,Algorithm ,Large margin nearest neighbor ,k-nearest neighbors algorithm - Abstract
In this paper, we study a new form of reverse nearest neighbor (RNN) queries, i.e., obstructed reverse nearest neighbor (ORNN) search. It considers the impact of obstacles on the distance between objects, which is ignored by the existing work on RNN retrieval. Given a data set P, an obstacle set O, and a query point q in a 2D space, an ORNN query finds all the points/objects in P that have q as their nearest neighbor, according to the obstructed distance metric, i.e., the length of the shortest path between two points without crossing any obstacle. We formalize ORNN search, develop effective pruning heuristics (via introducing a novel boundary region concept), and propose efficient algorithms for ORNN query processing, assuming that both P and O are indexed by traditional data-partitioning indexes (e.g., R-trees). Extensive experiments demonstrate the effectiveness of our developed pruning heuristics and the performance of our proposed algorithms, using both real and synthetic datasets.
- Published
- 2011
- Full Text
- View/download PDF
12. UPS
- Author
-
Gang Chen, Lidan Shou, Yunjun Gao, Ke Chen, and He Bai
- Subjects
Service (systems architecture) ,Information retrieval ,User profile ,Generalization ,business.industry ,Computer science ,media_common.quotation_subject ,Personalization ,The Internet ,Quality (business) ,Metric (unit) ,business ,Private information retrieval ,media_common - Abstract
In recent years, personalized web search (PWS) has demonstrated effectiveness in improving the quality of search service on the Internet. Unfortunately, the need for collecting private information in PWS has become a major barrier for its wide proliferation. We study privacy protection in PWS engines which capture personalities in user profiles. We propose a PWS framework called UPS that can generalize profiles in for each query according to user-specified privacy requirements. Two predictive metrics are proposed to evaluate the privacy breach risk and the query utility for hierarchical user profile. We develop two simple but effective generalization algorithms for user profiles allowing for query-level customization using our proposed metrics. We also provide an online prediction mechanism based on query utility for deciding whether to personalize a query in UPS. Extensive experiments demonstrate the efficiency and effectiveness of our framework.
- Published
- 2011
- Full Text
- View/download PDF
13. Direction-based spatial skylines
- Author
-
Yoshiharu Ishikawa, Yunjun Gao, and Xi Guo
- Subjects
Spatial query ,Skyline ,Computer science ,Location-based service ,InformationSystems_DATABASEMANAGEMENT ,Snapshot (computer storage) ,Object-based spatial database ,Data mining ,computer.software_genre ,computer - Abstract
Traditional location-based services recommend nearest objects to the user by considering their spatial proximity. However, an object not only has its distance but also has its direction which originates from the user to it. In this paper, we study direction-based spatial skyline queries (DSS queries) which retrieve nearest objects around the user from different directions. The closer object is better than or dominates the further object if they are in the same direction. The objects that cannot be dominated by any other object are included in the direction-based spatial skyline (DSS). We propose algorithms to answer snapshot queries which find objects on the DSS according to the user's current position. We also develop algorithms to support continuous queries which retrieve objects on the DSS while the user is moving linearly. Extensive experiments verify the performance of our proposed algorithms using both real and synthetic datasets.
- Published
- 2010
- Full Text
- View/download PDF
14. Continuous obstructed nearest neighbor queries in spatial databases
- Author
-
Yunjun Gao and Baihua Zheng
- Subjects
Cover tree ,Computer science ,Nearest neighbor search ,Spatial database ,computer.software_genre ,k-nearest neighbors algorithm ,Best bin first ,Nearest neighbor graph ,Nearest-neighbor chain algorithm ,R-tree ,Ball tree ,Shortest path problem ,Data mining ,Fixed-radius near neighbors ,computer ,Algorithm ,Large margin nearest neighbor - Abstract
In this paper, we study a novel form of continuous nearest neighbor queries in the presence of obstacles, namely continuous obstructed nearest neighbor (CONN) search. It considers the impact of obstacles on the distance between objects, which is ignored by most of spatial queries. Given a data set P, an obstacle set O, and a query line segment q in a two-dimensional space, a CONN query retrieves the nearest neighbor of each point on q according to the obstructed distance, i.e., the shortest path between them without crossing any obstacle. We formulate CONN search, analyze its unique properties, and develop algorithms for exact CONN query processing, assuming that both P and O are indexed by conventional data-partitioning indices (e.g., R-trees). Our methods tackle the CONN retrieval by performing a single query for the entire query segment, and only process the data points and obstacles relevant to the final result, via a novel concept of control points and an efficient quadratic-based split point computation algorithm. In addition, we extend our solution to handle the continuous obstructed k-nearest neighbor (COkNN) search, which finds the k (≥1)nearest neighbors to every point along q based on obstructed distances. A comprehensive experimental evaluation using both real and synthetic datasets has been conducted to demonstrate the efficiency and effectiveness of our proposed algorithms.
- Published
- 2009
- Full Text
- View/download PDF
15. Continuous visible nearest neighbor queries
- Author
-
Wang-Chien Lee, Gencai Chen, Baihua Zheng, and Yunjun Gao
- Subjects
Computer science ,business.industry ,Nearest neighbor search ,Pattern recognition ,Interval (mathematics) ,k-nearest neighbors algorithm ,Set (abstract data type) ,Line segment ,Nearest neighbor graph ,Artificial intelligence ,Tuple ,Fixed-radius near neighbors ,business ,Algorithm - Abstract
In this paper, we identify and solve a new type of spatial queries, called continuous visible nearest neighbor (CVNN) search. Given a data set P, an obstacle set O, and a query line segment q, a CVNN query returns a set of (p, R) tuples such that p e P is the nearest neighbor (NN) to every point r along the interval R e q as well as p is visible to r. Note that p may be NULL, meaning that all points in P are invisible to all points in R, due to the obstruction of some obstacles in O. In this paper, we formulate the problem and propose efficient algorithms for CVNN query processing, assuming that both P and O are indexed by R-trees. In addition, we extend our techniques to several variations of the CVNN query. Extensive experiments verify the efficiency and effectiveness of our proposed algorithms using both real and synthetic datasets.
- Published
- 2009
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.