98 results on '"support vector clustering"'
Search Results
2. Data-driven contextual robust optimization based on support vector clustering
- Author
-
Li, Xianyu, Dong, Fenglian, Wei, Zhiwei, and Shang, Chao
- Published
- 2025
- Full Text
- View/download PDF
3. Artificial intelligence-based approach for islanding detection in cyber-physical power systems
- Author
-
Golpîra, Hêmin and Francois, Bruno
- Published
- 2024
- Full Text
- View/download PDF
4. A new data-driven robust optimization method for sustainable waste-to-energy supply chain network design problem
- Author
-
Liu, Naiqi, Tang, Wansheng, Chen, Aixia, and Lan, Yanfei
- Published
- 2025
- Full Text
- View/download PDF
5. Exploring Kernel Machines and Support Vector Machines: Principles, Techniques, and Future Directions.
- Author
-
Du, Ke-Lin, Jiang, Bingchun, Lu, Jiabin, Hua, Jingyu, and Swamy, M. N. S.
- Subjects
- *
COMPUTATIONAL learning theory , *STATISTICAL learning , *SUPPORT vector machines , *RADIAL basis functions , *KERNEL functions - Abstract
The kernel method is a tool that converts data to a kernel space where operation can be performed. When converted to a high-dimensional feature space by using kernel functions, the data samples are more likely to be linearly separable. Traditional machine learning methods can be extended to the kernel space, such as the radial basis function (RBF) network. As a kernel-based method, support vector machine (SVM) is one of the most popular nonparametric classification methods, and is optimal in terms of computational learning theory. Based on statistical learning theory and the maximum margin principle, SVM attempts to determine an optimal hyperplane by addressing a quadratic programming (QP) problem. Using Vapnik–Chervonenkis dimension theory, SVM maximizes generalization performance by finding the widest classification margin within the feature space. In this paper, kernel machines and SVMs are systematically introduced. We first describe how to turn classical methods into kernel machines, and then give a literature review of existing kernel machines. We then introduce the SVM model, its principles, and various SVM training methods for classification, clustering, and regression. Related topics, including optimizing model architecture, are also discussed. We conclude by outlining future directions for kernel machines and SVMs. This article functions both as a state-of-the-art survey and a tutorial. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Scalable decision fusion algorithm for enabling decentralized computation in distributed, big data clustering problems.
- Author
-
Jennath, H. S. and Asharaf, S.
- Abstract
In the world of big data, extracting meaningful insights from large and continually growing distributed datasets is a major challenge. Classical clustering algorithms are effective at identifying clusters with convex structures. However, they fall short in identifying arbitrary-shaped clusters (more irregular and complex patterns), which are often encountered in real-world applications. The process of identifying non-convex cluster representations from very large and growing datasets is a challenge. It is further compounded by the distributed nature of the data, necessitating complex computations across multiple devices. Support Vector Clustering (SVC) is a much-celebrated algorithm capable of finding arbitrarily shaped clusters. However, the major limitation of this algorithm is that it will not scale to large volumes of data as the time and space complexity is high. The second limitation of the SVC algorithm is the requirement for large computation time in finding cluster structures. The adoption of a coreset based methodology is required for finding the true representation of the underlying large datasets. The implementation of hierarchical clustering on these distributed coresets, unlocks the potential to uncover a structured hierarchy of abstractions across the disseminated data. Moreover, a distance-based clustering approach guarantees the identification of clusters with diverse and arbitrary shapes, providing a robust framework for detecting complex structures. This research utilizes the Core Vector Machine (CVM) approach using an approximate Minimum Enclosing Ball (MEB) algorithm to efficiently address the complexities inherent in traditional SVC. Additionally, an enhanced medoid algorithm is employed for cluster head identification across the data sources. Hierarchical clustering is performed in the Reproducing Kernel Hilbert Space (RKHS) using cosine similarity distance matrices. This is used to identify compact non-convex clusters within distributed datasets. Performance assessment involves benchmarking our approach against state-of-the-art improved SVC algorithms using large datasets. The outcomes validate the superior performance of our approach compared to existing methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Robust LNG sales planning under demand uncertainty: A data-driven goal-oriented approach
- Author
-
Yulin Feng, Xianyu Li, Dingzhi Liu, and Chao Shang
- Subjects
Robust optimization ,Uncertainty set ,Data-driven decision-making ,Support vector clustering ,LNG sales planning ,Mixed-integer linear programming ,Chemical engineering ,TP155-156 ,Information technology ,T58.5-58.64 - Abstract
This paper addresses the liquefied natural gas (LNG) sales planning problem over a pipeline network with a focus on uncertain demands. Generically, the total profit is maximized by seeking optimal transportation and inventory decisions, and robust optimization (RO) has been a viable decision-making strategy to this end, which is however known to suffer from over-conservatism. To circumvent this, a new goal-oriented data-driven RO approach is proposed. First, we adopt data-driven polytopic uncertainty sets based on kernel learning, which yields a compact high-density region from data and assures tractability of RO problems. Based on this, a new goal-oriented RO formulation is put forward to satisfy to the greatest extent the target profit while tolerating slight constraint violations. In contrast to traditional min–max RO scheme, the proposed scheme not only ensures a flexible trade-off but also yields parameters with clear interpretation. The resulting optimization problem turns out to be equivalent to a mixed-integer linear program that can be effectively handled using off-the-shelf solvers. We illustrate the merit of the proposed method in satisfying a prescribed goal with optimized robustness by means of a case study.
- Published
- 2023
- Full Text
- View/download PDF
8. 基于支持向量聚类和模糊粗糙集的交通流数据修复方法.
- Author
-
朱世超, 王骋程, 王超, 刘隆, 张润芝, and 王浩
- Subjects
- *
ROUGH sets , *TRAFFIC flow , *FUZZY sets , *FUZZY neural networks , *MISSING data (Statistics) , *GENETIC algorithms - Abstract
In order to solve the problems of missing traffic flow data caused by various reasons such as weather effect, detector faults and artificial error etc., this paper proposed a method based on the fuzzy rough set theory to impute missing traffic flow data. We combined the support vector clustering and fuzzy rough set to classify traffic flow data, and then combined the fuzzy neural network and genetic algorithm to impute missing data. The method optimized the support vector clustering parameters, cluster size and weighting factor, and estimated the missing values. The results of the study showed that the proposed novel hybrid method produced sufficient and reasonable data imputation performance results. Compared with the results of fuzzy neural network and other estimation models, the data imputation effect of this model was better than other comparison models. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
9. Data-driven robust optimization for pipeline scheduling under flow rate uncertainty.
- Author
-
Baghban, Amir, Castro, Pedro M., and Oliveira, Fabricio
- Subjects
- *
ROBUST optimization , *LINEAR programming , *ROBUST programming , *WAREHOUSES , *TRANSPORTATION schedules - Abstract
• Efficient deterministic continuous-time MILP model for scheduling the transportation of oil derivatives via pipeline. • Smaller-sized model and faster computational time compared to previous models. • Scheduling under uncertainty using data-driven robust optimization. • Application of support vector clustering in forming the uncertainty set. • Easy and meaningful trade-off between robustness and optimality. Frequently, parameters in optimization models are subject to a high level of uncertainty coming from several sources and, as such, assuming them to be deterministic can lead to solutions that are infeasible in practice. Robust optimization is a computationally efficient approach that generates solutions that are feasible for realizations of uncertain parameters near the nominal value. This paper develops a data-driven robust optimization approach for the scheduling of a straight pipeline connecting a single refinery with multiple distribution centers, considering uncertainty in the injection rate. For that, we apply support vector clustering to learn an uncertainty set for the robust version of the deterministic model. We compare the performance of our proposed robust model against one utilizing a standard robust optimization approach and conclude that data-driven robust solutions are less conservative. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
10. Rough-Fuzzy Support Vector Clustering with OWA Operators.
- Author
-
Saltos, Ramiro and Weber, Richard
- Subjects
- *
SOFT computing , *FUZZY sets , *ALGORITHMS , *DATA mining - Abstract
Rough-Fuzzy Support Vector Clustering (RFSVC) is a novel soft computing derivative of the classical Support Vector Clustering (SVC) algorithm used successfully in many real-world applications. The strengths of RFSVC are its ability to handle arbitrary cluster shapes, identify the number of clusters, and effectively detect outliers by using the membership degrees. However, its current version uses only the closest support vector of each cluster to calculate outliers’ membership degrees, neglecting important information that remaining support vectors can contribute. We present a novel approach based on the ordered weighted average (OWA) operator that aggregates information from all cluster representatives when computing final membership degrees and, at the same time, allows a better interpretation of the cluster structures found. Particularly, we propose the OWA using weights computed by the linguistic and exponential quantifiers. The computational experiments show that our approach obtains comparable results with the current version of RFSVC. However, the former weights all clusters’ support vectors in the computation of membership degrees while maintaining their interpretability level for detecting outliers. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
11. Research and Experiment of Radar Signal Support Vector Clustering Sorting Based on Feature Extraction and Feature Selection
- Author
-
Shiqiang Wang, Caiyun Gao, Qin Zhang, Veerendra Dakulagi, Huiyong Zeng, Guimei Zheng, Juan Bai, Yuwei Song, Jiliang Cai, and Binfeng Zong
- Subjects
Feature extraction ,feature selection ,feature set ,support vector machine ,support vector clustering ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
The result of radar signal sorting directly affects the performance of electronic reconnaissance equipment. Sorting method based on intra-pulse features has become a research focus in recent years. However, as the number of extracted features increases, the dimension of the feature vector becomes higher and higher. And too many dimensional feature vectors would make the complexity of the sorting algorithm increase geometrically. In this way, feature selection becomes more and more necessary. Combining the latest research on fuzzy rough sets, this paper proposes two feature selection methods, namely two-steps attribute reduction based on fuzzy dependency (TARFD) algorithm and fuzzy rough artificial bee colony (FRABC) algorithm. The TARFD method uses the candidate attribute set as starting point, according to the definition of the redundant attribute set. Then the less important attributes are successively eliminated. The FRABC method starts from the dependence degree of fuzzy rough set, and constructs a fitness function that reflects the importance of the attribute subset and the reduction rate. Based on this function, the artificial bee colony algorithm is used to reduce the attributes of the dataset. Using the TARFD and FRABC algorithms, the extracted feature sets, including entropy feature set, Zernike moment feature set, pseudo Zernike feature set, gray level co-occurrence matrix (GLCM) feature set, and Hu-invariant moment feature set are processed, then an optimal feature subset was obtained and a sorting test was performed. The results show the effectiveness of the extracted intra-pulse features and the efficiency of the feature selection algorithm.
- Published
- 2020
- Full Text
- View/download PDF
12. Identification of Coherent Generators by Support Vector Clustering With an Embedding Strategy
- Author
-
Mehdi Babaei, S. M. Muyeen, and Syed Islam
- Subjects
Coherent generators ,dynamic coupling ,embedding ,slow coherency ,support vector clustering ,synchrophasor ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Identification of coherent generators (CGs) is necessary for the area-based monitoring and protection system of a wide area power system. Synchrophasor has enabled smarter monitoring and control measures to be devised; hence, measurement-based methodologies can be implemented in online applications to identify the CGs. This paper presents a new framework for coherency identification that is based on the dynamic coupling of generators. A distance matrix that contains the dissimilarity indices between any pair of generators is constructed from the pairwise dynamic coupling of generators after the post-disturbance data are obtained by phasor measurement units (PMUs). The dataset is embedded in Euclidean space to produce a new dataset with a metric distance between the points, and then the support vector clustering (SVC) technique is applied to the embedded dataset to identify the final clusters of generators. Unlike other clustering methods that need a priori knowledge about the number of clusters or the parameters of clustering, this information is set in an automatic search procedure that results in the optimal number of clusters. The algorithm is verified by time-domain simulations of defined scenarios in 39 bus and 118 bus test systems. Finally, the clustering result of 39 bus systems is validated by cluster validity measures, and a comparative study investigates the efficacy of the proposed algorithm to cluster the generators with an optimal number of clusters and also its computational efficiency compared with other clustering methods.
- Published
- 2019
- Full Text
- View/download PDF
13. Efficient Training Support Vector Clustering With Appropriate Boundary Information
- Author
-
Yuan Ping, Bin Hao, Huina Li, Yuping Lai, Chun Guo, Hui Ma, Baocang Wang, and Xiali Hei
- Subjects
Support vector clustering ,shrinkable boundary selection ,dual coordinate descent ,traffic analysis ,intrusion detection ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Due to the remarkable capability in handling arbitrary cluster shapes, support vector clustering (SVC) benefits data analysis in terms of data description. However, large-scale data such as network traffic frequently makes it suffer from highly intensive pricey computation and storage for solving the dual problem and storing the kernel matrix, respectively. Fortunately, support vectors which describe the clusters, in a sense, are expected in the boundaries. To tackle this issue, we propose an efficient training SVC with appropriate boundary information (ETSVC), which features excellent flexibility and scalability. In ETSVC, we first give a shrinkable boundary selection (SBS) method which collects appropriate boundaries while reducing redundancy and noise. Based on the boundary information, a redefined dual problem is then designed without scarifying the principle of SVC. Finally, we design a reformative solver (RSolver) to reformulate the training phase, which estimates the support vector function by solving the dual problem. Since only a subset of boundaries is employed for model training, theoretical analysis suggests that ETSVC reaches efficiency improvement and consumes much less memory if sacrificing efficiency to reduce storage consumption. Towards grouping P2P flows and large-scale intrusion traffic, as well as other non-traffic data, experimental results confirm that ETSVC could be applied to resources constrained platform while achieving comparable accuracies with the state-of-the-art methods.
- Published
- 2019
- Full Text
- View/download PDF
14. EXPLORING EFFICIENT KERNEL FUNCTIONS FOR SUPPORT VECTOR CLUSTERING.
- Author
-
BAĞCI, Furkan Burak and KARAL, Ömer
- Subjects
VECTOR valued functions ,FUZZY algorithms ,K-means clustering ,DATA analysis - Abstract
Copyright of Mugla Journal of Science & Technology is the property of Mugla Journal of Science & Technology and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2020
- Full Text
- View/download PDF
15. Robust multi-product inventory optimization under support vector clustering-based data-driven demand uncertainty set.
- Author
-
Qiu, Ruozhen, Sun, Yue, Fan, Zhi-Ping, and Sun, Minghe
- Subjects
- *
ROBUST optimization , *LINEAR programming , *UNCERTAINTY , *INVENTORIES - Abstract
A robust multi-product inventory optimization approach is developed with an uncertainty set constructed from the available data using support vector clustering (SVC). The multi-product inventory problem is subject to demand uncertainties in a newsvendor setting with the historical demand data as the only available information. By using SVC, the uncertainty set to which the uncertain demands belong is constructed with a certain confidence in a data-driven approach. The associated robust counterpart model is then developed using the absolute robustness criterion. Through mathematical deduction, the proposed counterpart model is transformed into a tractable linear programming model which can be solved efficiently. The transformed and the original models are proved to be mathematically equivalent. Numerical studies are conducted to illustrate the effectiveness and practicality of the proposed SVC-based data-driven robust optimization approach for dealing with demand uncertainties. The results show that the robust optimization approach under the proposed SVC-based uncertainty set outperforms those under the traditional, i.e., the box and the ellipsoid, uncertainty sets. These results provide evidences that the proposed data-driven robust optimization approach can better hedge against demand uncertainties in multi-product inventory problems. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
16. Feedback-driven real-time forecasting method for the arrival times of electric vehicles.
- Author
-
Wu, Chuanshen, Han, Haiteng, and Gao, Shan
- Subjects
- *
ELECTRIC vehicles , *PROBABILITY density function , *FORECASTING , *TRAFFIC violations , *WEATHER - Abstract
• A forecasting method for the arrival times of electric vehicles is proposed. • An optimal parameter modification model is established based on the feedback. • Historical data is used to obtain the optimization range of optimal parameter values. • The optimization range is dynamically adjusted considering the robustness. Affected by weather conditions, traffic conditions, and driver behavior, the arrival characteristics of electric vehicles (EVs) vary significantly from day to day. This study proposes a feedback-driven real-time forecasting approach that combines historical data to improve the forecasting accuracy of arrival times of EVs. For model-based forecasting methods that sample from probability density functions (PDFs), the related parameter values are dynamically optimized. Compared with sampling from PDFs with empirical parameter values, the dynamic optimal parameter values can track the characteristics of EV arrivals by fully using the continuously updated EV feedback. Considering robustness, a historical data-based support vector clustering technology is utilized to obtain the optimization range of optimal parameter values. As a key of this study, the conservativeness of the optimization range is dynamically adjusted with the periodic updates of EV feedback. The experimental results indicate that, by making full utilization of EV feedback, the proposed method can effectively reduce the forecasting errors of EV arrival times. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
17. Minimum Distribution Support Vector Clustering
- Author
-
Yan Wang, Jiali Chen, Xuping Xie, Sen Yang, Wei Pang, Lan Huang, Shuangquan Zhang, and Shishun Zhao
- Subjects
support vector clustering ,margin theory ,mean ,variance ,dual coordinate descent ,Science ,Astrophysics ,QB460-466 ,Physics ,QC1-999 - Abstract
Support vector clustering (SVC) is a boundary-based algorithm, which has several advantages over other clustering methods, including identifying clusters of arbitrary shapes and numbers. Leveraged by the high generalization ability of the large margin distribution machine (LDM) and the optimal margin distribution clustering (ODMC), we propose a new clustering method: minimum distribution for support vector clustering (MDSVC), for improving the robustness of boundary point recognition, which characterizes the optimal hypersphere by the first-order and second-order statistics and tries to minimize the mean and variance simultaneously. In addition, we further prove, theoretically, that our algorithm can obtain better generalization performance. Some instructive insights for adjusting the number of support vector points are gained. For the optimization problem of MDSVC, we propose a double coordinate descent algorithm for small and medium samples. The experimental results on both artificial and real datasets indicate that our MDSVC has a significant improvement in generalization performance compared to SVC.
- Published
- 2021
- Full Text
- View/download PDF
18. Study on phosphor powder precipitation model in flexible material manufacturing process based on neuro-fuzzy network.
- Author
-
Deng, Yaohua, Lu, Qiwen, Yao, Kexing, and Zhou, Na
- Subjects
- *
SUPPORT vector machines , *SILICA gel , *MACHINE learning , *PHOSPHORS , *PREDICTION models - Abstract
The precipitation of LED phosphor glue is not only related to the physical properties of phosphor powder and silica gel, but also influenced by the uncertainties in the production process. In this paper, support vector clustering (SVC) is combined with T-S neuro-fuzzy network to build the neuro-fuzzy network prediction model of phosphor powder precipitation. The structure identification of the predictive model and the neuro-fuzzy network parameter learning algorithm are derived. Finally, the flow chart of the modeling of predictive model is given. The test results show that the training time of the new TSFNN proposed in this paper is 56% less than the standard TSFNN model and the average error of the new TSFNN is 33.33% less than the standard one. LED phosphor powder mixing system test shows that the new TSFNN model control system effectively enhances the LED light color consistency comparing with the traditional method. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
19. Community detection in complex networks using proximate support vector clustering.
- Author
-
Feifan Wang, Baihai Zhang, Senchun Chai, and Yuanqing Xia
- Subjects
- *
HIERARCHICAL clustering (Cluster analysis) , *CLUSTER analysis (Statistics) , *SUPPORT vector machines , *COMPUTER networks , *ALGORITHMS - Abstract
Community structure, one of the most attention attracting properties in complex net-works, has been a cornerstone in advances of various scientific branches. A number of tools have been involved in recent studies concentrating on the community detection algorithms. In this paper, we propose a support vector clustering method based on a proximity graph, owing to which the introduced algorithm surpasses the traditional support vector approach both in accuracy and complexity. Results of extensive experiments undertaken on computer generated networks and real world data sets illustrate competent performances in comparison with the other counterparts. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
20. Data-driven robust portfolio optimization with semi mean absolute deviation via support vector clustering.
- Author
-
Sehgal, Ruchika and Jagadesh, Pattem
- Subjects
- *
PORTFOLIO management (Investments) , *ROBUST optimization , *DOW Jones industrial average , *LINEAR programming , *ROBUST control , *VALUE at risk - Abstract
The portfolio optimization (PO) model with semi-mean absolute deviation (SMAD) risk measure has been commonly applied to construct optimal portfolios due to the ease of solving the corresponding linear programming model. We propose a robust PO model with SMAD that considers the uncertainty associated with asset expected returns. This uncertainty is dealt by adopting a data-driven approach that captures the uncertain asset returns in a convex uncertainty set through support vector clustering. The proposed model involves solving a quadratic programming problem to identify support vectors and a robust linear PO model. The ability of the proposed technique to control the conservatism and the computational ease associated with it makes it a practical approach to yield robust optimal portfolios. The effectiveness of the model is demonstrated by constructing optimal portfolios with the constituents of four global market indices, namely Dow Jones Industrial Average (USA), DAX 30 (Germany), Nifty 50 (India), and EURO STOXX 50 (Europe). The out-of-sample statistics generated from the robust portfolios are compared with the optimal portfolios obtained from its nominal counterpart, naive 1 / n strategy, and other robust technique available in the literature. We find that the proposed model consistently performs well in most data sets over several performance measures like average returns, risk measured by standard deviation, value at risk, conditional value at risk and various reward-risk ratios. Comparative analysis of these models in different market phases of EURO STOXX 50 demonstrates the effectiveness of the developed robust technique, especially during the bearish phase. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
21. A Coherency Identification Method of Active Frequency Response Control Based on Support Vector Clustering for Bulk Power System
- Author
-
Cuicui Jin, Weidong Li, Liu Liu, Ping Li, and Xian Wu
- Subjects
active frequency response ,bulk power system ,support vector clustering ,wide area measurement system ,frequency dynamic curve ,primary frequency control ,Technology - Abstract
Active frequency response (AFR) control is needed in current power systems. To solve the over-frequency problems of generators connected to non-disturbed buses during the AFR control period, the generators should be clustered into coherent groups. Thus, the control efficiency can be improved on the premise of ensuring control accuracy. Since the influencing factors (such as the model parameters, operation modes, and disturbance locations, etc.) of power system operation can be comprehensively reflected by the generator frequency, which is easily collected and calculated, the generator frequency can be used as the coherency identification input. In this paper, we propose a coherency identification method of AFR control based on support vector clustering for a bulk power system. By mapping data samples from the initial space to the high-dimensional feature space, the radius of the minimal enclosing sphere that can envelop all the data samples is obtained. Moreover, the coherency identification of generators is determined for AFR control according to the evaluating method of AFR clustering control effects and the evaluating index of cluster compactness and separation. The simulation results for the modified New England IEEE 10-generator 39-bus system and Henan power grid show that the proposed method is feasible and effective.
- Published
- 2019
- Full Text
- View/download PDF
22. Technology forecasting using matrix map and patent clustering
- Author
-
Jun, Sunghae, Sung Park, Sang, and Sik Jang, Dong
- Published
- 2012
- Full Text
- View/download PDF
23. Data-driven robust optimization based on kernel learning.
- Author
-
Shang, Chao, Huang, Xiaolin, and You, Fengqi
- Subjects
- *
KERNEL operating systems , *QUADRATIC programming , *DATA analysis , *COMPUTATIONAL complexity , *MATHEMATICAL optimization - Abstract
We propose piecewise linear kernel-based support vector clustering (SVC) as a new approach tailored to data-driven robust optimization. By solving a quadratic program, the distributional geometry of massive uncertain data can be effectively captured as a compact convex uncertainty set, which considerably reduces conservatism of robust optimization problems. The induced robust counterpart problem retains the same type as the deterministic problem, which provides significant computational benefits. In addition, by exploiting statistical properties of SVC, the fraction of data coverage of the data-driven uncertainty set can be easily selected by adjusting only one parameter, which furnishes an interpretable and pragmatic way to control conservatism and exclude outliers. Numerical studies and an industrial application of process network planning demonstrate that, the proposed data-driven approach can effectively utilize useful information with massive data, and better hedge against uncertainties and yield less conservative solutions. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
24. FRSVC: Towards making support vector clustering consume less.
- Author
-
Ping, Yuan, Tian, Yingjie, Guo, Chun, Wang, Baocang, and Yang, Yuehua
- Subjects
- *
SUPPORT vector machines , *DOCUMENT clustering , *PROBLEM solving , *KERNEL (Mathematics) , *NEAREST neighbor analysis (Statistics) - Abstract
In spite of with great advantage of discovering arbitrary shapes of clusters, support vector clustering (SVC) is frustrated by large-scale data, especially on resource limited platform. It is due to pricey storage and computation consumptions from solving dual problem and labeling clusters upon the pre-computed kernel matrix and sampling point pairs, respectively. Towards on it, we first present a dual coordinate descent method to reformulate the solver that leads to a flexible training phase carried out on any runtime platform with/without sufficient memory. Then, a novel labeling phase who does connectivity analysis between two nearest neighboring decomposed convex hulls referring to clusters is proposed, in which a new designed strategy namely sample once connected checking first tries to reduces the scope of sampling analysis. By integrating them together, a faster and reformulated SVC (FRSVC) is created with less consumption achieved according to comparative analysis of time and space complexities. Furthermore, experimental results confirm a significant improvement on flexibility of selective efficiency without losing accuracy, with which a balance can be easily reached on the basis of resources a platform equipped. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
25. Fast support vector clustering.
- Author
-
Pham, Tung, Dang, Hang, Le, Trung, and Le, Thai
- Subjects
CLUSTER analysis (Statistics) ,OUTLIER detection ,SUPPORT vector machines ,KERNEL (Mathematics) ,STOCHASTIC analysis - Abstract
Support-based clustering has recently absorbed plenty of attention because of its applications in solving the difficult and diverse clustering or outlier detection problem. Support-based clustering method perambulates two phases: finding the domain of novelty and performing the clustering assignment. To find the domain of novelty, the training time given by the current solvers is typically over-quadratic in the training size. This fact impedes the application of support-based clustering method to the large-scale datasets. In this paper, we propose applying stochastic gradient descent framework to the first phase of support-based clustering for finding the domain of novelty in the form of a half-space and a new strategy to perform the clustering assignment. We validate our proposed method on several well-known datasets for clustering task to show that the proposed method renders a comparable clustering quality to the baselines while being faster than them. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
26. Privacy‐preserving evaluation for support vector clustering
- Author
-
Jaewook Lee, Saerom Park, and Junyoung Byun
- Subjects
Privacy preserving ,Computer science ,Support vector clustering ,Data mining ,Electrical engineering. Electronics. Nuclear engineering ,Electrical and Electronic Engineering ,computer.software_genre ,computer ,Computer Science::Cryptography and Security ,TK1-9971 - Abstract
The authors proposed a privacy‐preserving evaluation algorithm for support vector clustering with a fully homomorphic encryption. The proposed method assigns clustering labels to encrypted test data with an encrypted support function. This method inherits the advantageous properties of support vector clustering, which is naturally inductive to cluster new test data from complex distributions. The authors efficiently implemented the proposed method with elaborate packing of the plaintexts and avoiding non‐polynomial operations that are not friendly to homomorphic encryption. These experimental results showed that the proposed model is effective in terms of clustering performance and has robustness against the error that occurs from homomorphic evaluation and approximate operations.
- Published
- 2021
27. DESTEK VEKTÖR ÖBEKLEME İÇİN ETKİLİ KERNEL FONKSİYONLARININ ARAŞTIRILMASI
- Author
-
Furkan Burak Bağci and Omer Karal
- Subjects
Fuel Technology ,Computer science ,business.industry ,Energy Engineering and Power Technology ,Support vector clustering ,Pattern recognition ,Artificial intelligence ,business - Abstract
Clustering is an effective tool that divides data into different classes to reveal internal and previously unknown data schemes. However, in conventional clustering algorithms such as the k-means, k-NN, fuzzy c tool, the selection of the appropriate number of clusters for each data set is uncertain and varies with the data sets. Furthermore, the data sets to which the clustering algorithm is applied generally have nonlinear boundaries between clusters. Determining these nonlinear boundaries in the input space causes a complex problem. To overcome these problems, kernel-based clustering methods have been developed in recent years, which automatically determine the number and boundaries of clusters. In particular, the Support Vector Clustering (SVC) algorithm has received great attention in data analysis because of its features such as automatically determining the number of clusters and recognizing nonlinear boundaries based on the Gaussian kernel parameter. The number of clusters and region boundaries produced by SVC may show variation depending on the choice of the kernel function and its parameters. Therefore, the choice of kernel function plays a significant role. In this study, for the first time, the implementation of two different kernel (Cauchy and Laplacian) functions and evaluation of their performances have been realized within the framework of SVC. It was observed that the Laplacian kernel function performed better than Gauss and Cauchy kernel functions.
- Published
- 2020
- Full Text
- View/download PDF
28. Online Anomaly Detection Based on Support Vector Clustering
- Author
-
Mohammad Amin Adibi and Jamal Shahrabi
- Subjects
Online anomaly detection ,support vector clustering ,self-organizing map ,quadratic programming ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
A two-phase online anomaly detection method based on support vector clustering (SVC) in the presence of non-stationary data is developed in this paper which permits arbitrary-shaped data clusters to be precisely treated. In the first step, offline learning is performed to achieve an appropriate detection model. Then the current model dynamically evolves to match the rapidly changing real-world data. To reduce the dimension of the quadratic programming (QP) problem emerging in the SVC, self-organizing map (SOM) and a replacement mechanism are used to summarize the incoming data. Thus, the proposed method can be efficiently and effectively useable in real time applications. The performance of the proposed method is evaluated by a simulated dataset, three subsets extracted from the KDD Cup 99 dataset, and the keystroke dynamics dataset. Results illustrate capabilities of the proposed method in detection of new attacks as well as normal pattern changes over the time.
- Published
- 2015
- Full Text
- View/download PDF
29. Dynamic equivalencing of an active distribution network for large‐scale power system frequency stability studies.
- Author
-
Golpîra, Hêmin, Seifi, Hossein, and Haghifam, Mahmoud Reza
- Abstract
This study presents an approach for developing the dynamic equivalent model of an active distribution network (ADN), consisting of several micro‐grids, for frequency stability studies. The proposed grey‐box equivalent model relies on Prony analysis to establish stop time and load damping as the required modelling parameters. Support vector clustering (SVC) and grouping procedure are employed for aggregation and order‐reduction of ADN. This significantly decreases the sensitivity of the estimated parameters to operating point changes which, in turn, guarantees the model robustness. This is done through representing the SVC output, that is, clusters, by cluster substitutes. The final ADN dynamic equivalent model is represented by several groups, in which their mutual interactions are taken into account by a new developed mathematical‐based criterion. Simulation results reveal that the proposed model is robust which could successfully take into account the continuous and discontinuous uncertainties. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
30. Enhanced just-in-time modelling for online quality prediction in BF ironmaking.
- Author
-
Liu, Y. and Gao, Z.
- Subjects
- *
JUST-in-time systems , *BLAST furnaces , *IRON industry , *GAUSSIAN distribution , *SUPPORT vector machines - Abstract
Various data driven soft sensor models have been established for online prediction of the silicon content in blast furnace ironmaking processes. However, two main disadvantages still remain in these empirical models. First, most of traditional outlier detection methods for preprocessing the data samples assume that they (approximately) follow a Gaussian distribution and thus may be invalid for some situations. To address this problem, a support vector clustering (SVC) based efficient outlier detection method is proposed whereby the process nonlinearity and non-Gaussianity can be better handled. Second, only using a single global model is insufficient to capture all the process characteristics, especially for those complicated regions. In this paper, a reliable just-in-time modelling method is proposed. The SVC outlier detection is integrated into the just-in-time-based local modelling method to enhance the reliability of quality prediction. A healthier relevant data set is constructed to build a more reliable local prediction model. Moreover, the historical data set is updated repetitively in a reasonable way. The superiority of the proposed method is demonstrated and compared with other soft sensors in terms of online prediction of the silicon content in an industrial blast furnace in China. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
31. Fast and scalable support vector clustering for large-scale data analysis.
- Author
-
Ping, Yuan, Chang, Yun, Zhou, Yajian, Tian, Ying, Yang, Yi, and Zhang, Zhili
- Subjects
SUPPORT vector machines ,DOCUMENT clustering ,MATHEMATICAL decomposition ,ALGORITHMS ,DATA analysis - Abstract
As an important boundary-based clustering algorithm, support vector clustering (SVC) benefits multiple applications for its capability of handling arbitrary cluster shapes. However, its popularity is degraded by both its highly intensive pricey computation and poor label performance which are due to redundant kernel function matrix required by estimating a support function and ineffectively checking segmers between all pairs of data points, respectively. To address these two problems, a fast and scalable SVC (FSSVC) method is proposed in this paper to achieve significant improvement on efficiency while guarantees a comparable accuracy with the state-of-the-art methods. The heart of our approach includes (1) constructing the hypersphere and support function by cluster boundaries which prunes unnecessary computation and storage of kernel functions and (2) presenting an adaptive labeling strategy which decomposes clusters into convex hulls and then employs a convex-decomposition-based cluster labeling algorithm or cone cluster labeling algorithm on the basis of whether the radius of the hypersphere is greater than 1. Both theoretical analysis and experimental results (e.g., the first rank of a nonparametric statistical test) show the superiority of our method over the others, especially for large-scale data analysis under limited memory requirements. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
32. Cluster validity measure and merging system for hierarchical clustering considering outliers.
- Author
-
de Morsier, Frank, Tuia, Devis, Borgeaud, Maurice, Gass, Volker, and Thiran, Jean-Philippe
- Subjects
- *
HIERARCHICAL clustering (Cluster analysis) , *MEASURE theory , *OUTLIERS (Statistics) , *ALGORITHMS , *GAUSSIAN processes , *SUPPORT vector machines - Abstract
Clustering algorithms have evolved to handle more and more complex structures. However, the measures that allow to qualify the quality of such clustering partitions are rare and have been developed only for specific algorithms. In this work, we propose a new cluster validity measure (CVM) to quantify the clustering performance of hierarchical algorithms that handle overlapping clusters of any shape and in the presence of outliers. This work also introduces a cluster merging system (CMS) to group clusters that share outliers. When located in regions of cluster overlap, these outliers may be issued by a mixture of nearby cores. The proposed CVM and CMS are applied to hierarchical extensions of the Support Vector and Gaussian Process Clustering algorithms both in synthetic and real experiments. These results show that the proposed metrics help to select the appropriate level of hierarchy and the appropriate hyperparameters. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
33. Recent Advances in Support Vector Clustering: Theory and Applications.
- Author
-
Li, Huina and Ping, Yuan
- Subjects
- *
SUPPORT vector machines , *PARAMETER estimation , *DESCRIPTIVE statistics , *PATTERN recognition systems , *MATHEMATICAL optimization - Abstract
As an important boundary-based clustering algorithm, support vector clustering (SVC) can benefit many real applications owing to its capability of handling arbitrary cluster shapes, especially those directly or indirectly related to pattern exploration and description. As the application deepens, the importance of performance (i.e. criterions of accuracy and efficiency) of SVC increases. To identify gaps in the current methods and propose novel research directions for SVC, we present a survey of the literature in this area. Our approach is to classify the most recent advances into either theory or application. For theoretical contributions, advances related to parameter selection and optimization, dual-problem solutions, and cluster labeling are introduced. We also simultaneously summarize the advantages and drawbacks of each study. With respect to applications, we clearly describe eight groups of schemes based on SVC, either as individual or hybrid methods. Finally, we identify the gaps in SVC research and suggest several future research issues and trends. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
34. FACE RECOGNITION BASED ON IMPROVED SUPPORT VECTOR CLUSTERING.
- Author
-
Yongqing Wang and Xiling Liu
- Subjects
HUMAN facial recognition software ,SUPPORT vector machines ,GENETIC algorithms ,BIOMETRIC identification ,KERNEL operating systems - Abstract
Traditional methods for face recognition do not scale well with the number of training sample, which limits the wide applications of related techniques. We propose an improved Support Vector Clustering algorithm to handle the large-scale biometric feature data effectively. We prove theoretically that the proposed algorithm converges to the optimum within any given precision quickly. Compared to related state-of-the-art Support Vector Clustering algorithms, it has the competitive performances on both training time and accuracy. Besides, we use the proposed algorithm to handle classification problem, and face recognition, as well. Experiments on synthetic and real-world data sets demonstrate the validity of the proposed algorithm. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
35. A tensor-based deep learning framework.
- Author
-
Charalampous, Konstantinos and Gasteratos, Antonios
- Subjects
- *
SUPERVISED learning , *CALCULUS of tensors , *MACHINE learning , *HUMAN-robot interaction , *SPATIOTEMPORAL processes , *TENSOR algebra - Abstract
This paper presents an unsupervised deep learning framework that derives spatio-temporal features for human–robot interaction. The respective models extract high-level features from low-level ones through a hierarchical network, viz. the Hierarchical Temporal Memory (HTM), providing at the same time a solution to the curse of dimensionality in shallow techniques. The presented work incorporates the tensor-based framework within the operation of the nodes and, thus, enhances the feature derivation procedure. This is due to the fact that tensors allow the preservation of the initial data format and their respective correlation and, moreover, attain more compact representations. The computational nodes form spatial and temporal groups by exploiting the multilinear algebra and subsequently express the samples according to those groups in terms of proximity. This generic framework may be applied in a diverse of visual data, while it has been examined on sequences of color and depth images, exhibiting remarkable performance. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
36. Support vector clustering of time series data with alignment kernels.
- Author
-
Boecking, Benedikt, Chalup, Stephan K., Seese, Detlef, and Wong, Aaron S.W.
- Subjects
- *
SUPPORT vector machines , *TIME series analysis , *DATA analysis , *KERNEL (Mathematics) , *CLUSTER analysis (Statistics) - Abstract
Highlights: [•] We cluster time series with a support vector clustering algorithm in a novel approach by using a Triangular Alignment Kernel. [•] We thus cluster time series without determining the number of clusters or their shape in advance. [•] We show that the quality of our results is competitive with other clustering approaches using the same dataset. [Copyright &y& Elsevier]
- Published
- 2014
- Full Text
- View/download PDF
37. Identification of temporal variations in mental workload using locally-linear-embedding-based EEG feature reduction and support-vector-machine-based clustering and classification techniques.
- Author
-
Yin, Zhong and Zhang, Jianhua
- Subjects
- *
PSYCHOLOGICAL stress , *ELECTROENCEPHALOGRAPHY , *SUPPORT vector machines , *CLUSTER analysis (Statistics) , *RECOGNITION (Psychology) , *HUMAN-machine systems - Abstract
Highlights: [•] Recognition of mental workload is crucial to improve the safety of human–machine system. [•] A simulated process control task was used to elicit different levels of mental workload. [•] Mental workload was assessed by using multichannel EEG signals. [•] Locally linear embedding was used to reduce the dimensionality of EEG power features. [•] Support vector clustering and support vector data description methods were combined to classify EEG data. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
38. Multiple-Parameter Radar Signal Sorting Using Support Vector Clustering and Similitude Entropy Index.
- Author
-
Wang, Zhanling, Zhang, Dengfu, Bi, Duyan, and Wang, Shiqiang
- Subjects
- *
RADAR signal processing , *SORTING (Electronic computers) , *INFORMATION theory , *ELECTRONIC intelligence , *PARAMETER estimation , *SUPPORT vector machines - Abstract
The radar signal sorting method based on traditional support vector clustering (SVC) algorithm takes a high time complexity, and the traditional validity index cannot efficiently indicate the best sorting result. Aiming at solving the problem, we study a new sorting method based on cone cluster labeling (CCL) method. The CCL method relies on the theory of approximate coverings both in feature space and data space. Also a new cluster validity index, similitude entropy (SE), is proposed. It can be used to evaluate the compactness and separation of clusters with information entropy theory. Simulations including the performance comparison between the proposed method and the conventional methods are presented. Results show that while maintaining the sorting accuracy, the proposed method can reduce the computing complexity effectively in sorting the signals. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
39. Support vector description of clusters for content-based image annotation.
- Author
-
Sun, Liang, Ge, Hongwei, Yoshida, Shinichi, Liang, Yanchun, and Tan, Guozhen
- Subjects
- *
SUPPORT vector machines , *CONTENT-based image retrieval , *COMPUTER vision , *MACHINE learning , *PROBABILITY theory , *SIMULATION methods & models - Abstract
Abstract: Continual progress in the fields of computer vision and machine learning has provided opportunities to develop automatic tools for tagging images; this facilitates searching and retrieving. However, due to the complexity of real-world image systems, effective and efficient image annotation is still a challenging problem. In this paper, we present an annotation technique based on the use of image content and word correlations. Clusters of images with manually tagged words are used as training instances. Images within each cluster are modeled using a kernel method, in which the image vectors are mapped to a higher-dimensional space and the vectors identified as support vectors are used to describe the cluster. To measure the extent of the association between an image and a model described by support vectors, the distance from the image to the model is computed. A closer distance indicates a stronger association. Moreover, word-to-word correlations are also considered in the annotation framework. To tag an image, the system predicts the annotation words by using the distances from the image to the models and the word-to-word correlations in a unified probabilistic framework. Simulated experiments were conducted on three benchmark image data sets. The results demonstrate the performance of the proposed technique, and compare it to the performance of other recently reported techniques. [Copyright &y& Elsevier]
- Published
- 2014
- Full Text
- View/download PDF
40. A new approach for rule extraction of expert system based on SVM.
- Author
-
Li, Ai and Chen, Guo
- Subjects
- *
EXPERT systems , *SUPPORT vector machines , *GENETIC algorithms , *FEATURE extraction , *SAMPLING (Process) , *AIRPLANE motors - Abstract
Highlights: [•] A new approach is proposed to extract knowledge rules from Support Vector Clustering. [•] Use Genetic Algorithm to choose the features of the sample data. [•] SMOTE algorithm is adopted to resample fault samples. [•] The new method is used to extract knowledge rules for aero-engine oil monitoring expert system. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
41. Support vector clustering‐based direct coherency identification of generators in a multi‐machine power system.
- Author
-
Agrawal, Rimjhim and Thukaram, Dhadbanjan
- Abstract
This study investigates the application of support vector clustering (SVC) for the direct identification of coherent synchronous generators in large interconnected multi‐machine power systems. The clustering is based on coherency measure, which indicates the degree of coherency between any pair of generators. The proposed SVC algorithm processes the coherency measure matrix that is formulated using the generator rotor measurements to cluster the coherent generators. The proposed approach is demonstrated on IEEE 10 generator 39‐bus system and an equivalent 35 generators, 246‐bus system of practical Indian southern grid. The effect of number of data samples and fault locations are also examined for determining the accuracy of the proposed approach. An extended comparison with other clustering techniques is also included, to show the effectiveness of the proposed approach in grouping the data into coherent groups of generators. This effectiveness of the coherent clusters obtained with the proposed approach is compared in terms of a set of clustering validity indicators and in terms of statistical assessment that is based on the coherency degree of a generator pair. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
42. SVStream: A Support Vector-Based Algorithm for Clustering Data Streams.
- Author
-
Chang-Dong Wang, Jian-Huang Lai, Dong Huang, and Wei-Shi Zheng
- Subjects
- *
STREAMING technology , *DOCUMENT clustering , *SUPPORT vector machines , *DATA analysis , *KERNEL operating systems - Abstract
In this paper, we propose a novel data stream clustering algorithm, termed SVStream, which is based on support vector domain description and support vector clustering. In the proposed algorithm, the data elements of a stream are mapped into a kernel space, and the support vectors are used as the summary information of the historical elements to construct cluster boundaries of arbitrary shape. To adapt to both dramatic and gradual changes, multiple spheres are dynamically maintained, each describing the corresponding data domain presented in the data stream. By allowing for bounded support vectors (BSVs), the proposed SVStream algorithm is capable of identifying overlapping clusters. A BSV decaying mechanism is designed to automatically detect and remove outliers (noise). We perform experiments over synthetic and real data streams, with the overlapping, evolving, and noise situations taken into consideration. Comparison results with state-of-the-art data stream clustering methods demonstrate the effectiveness and efficiency of the proposed method. [ABSTRACT FROM PUBLISHER]
- Published
- 2013
- Full Text
- View/download PDF
43. Proximity multi-sphere support vector clustering.
- Author
-
Le, Trung, Tran, Dat, Nguyen, Phuoc, Ma, Wanli, and Sharma, Dharmendra
- Subjects
- *
SUPPORT vector machines , *CLUSTER analysis (Statistics) , *GRAPH theory , *CLASSIFICATION algorithms , *DISTRIBUTION (Probability theory) , *PROXIMITY matrices - Abstract
Support vector data description constructs an optimal hypersphere in feature space as a description of a data set. This hypersphere when mapped back to input space becomes a set of contours, and support vector clustering (SVC) employs these contours as cluster boundaries to detect clusters in the data set. However real-world data sets may have some distinctive distributions and hence a single hypersphere cannot be the best description. As a result, the set of contours in input space does not always detect all clusters in the data set. Another issue in SVC is that in some cases, it cannot preserve proximity notation which is crucial for cluster analysis, that is, two data points that are close to each other can be assigned to different clusters using cluster labelling method of SVC. To overcome these drawbacks, we propose Proximity Multi-sphere Support Vector Clustering which employs a set of hyperspheres to provide a better data description for data sets having distinctive distributions and a proximity graph to favour the proximity notation. Experimental results on different data sets are presented to evaluate the proposed clustering technique and compare it with SVC and other clustering techniques. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
44. Position regularized Support Vector Domain Description
- Author
-
Wang, Chang-Dong and Lai, JianHuang
- Subjects
- *
SUPPORT vector machines , *ALGORITHMS , *PARAMETER estimation , *DATA analysis , *APPLICATION software , *COMPUTATIONAL complexity - Abstract
Abstract: Support Vector Domain Description (SVDD) is an effective method for describing a set of objects. As a basic tool, several application-oriented extensions have been developed, such as support vector clustering (SVC), SVDD-based k-Means (SVDDk-Means) and support vector based algorithm for clustering data streams (SVStream). Despite its significant success, one inherent drawback is that the description is very sensitive to the selection of the trade-off parameter, which is hard to estimate in practice and affects the extensive approaches significantly. To tackle this problem, we propose a novel Position regularized Support Vector Domain Description (PSVDD). In the proposed PSVDD, the complexity of the sphere surface is adaptively regularized by assigning a position-based weighting to each data point, which is computed according to the distance between the corresponding feature space image and the mean of feature space images. To demonstrate the effectiveness of the proposed PSVDD, we apply the position-based weighting to improve two important clustering extensions, i.e., SVC and SVDDk-Means, which respectively result in two new clustering approaches termed PSVC and PSVDDk-Means. Experimental results on several real-world data sets validate the significant improvement achieved by PSVC and PSVDDk-Means. [Copyright &y& Elsevier]
- Published
- 2013
- Full Text
- View/download PDF
45. A support vector clustering-based probabilistic method for unsupervised fault detection and classification of complex chemical processes using unlabeled data.
- Author
-
Yu, Jie
- Subjects
SUPPORT vector machines ,PROBABILITY theory ,CHEMICAL processes ,DATA analysis ,FAULT diagnosis ,FISHER discriminant analysis - Abstract
A new support vector clustering ( SVC)-based probabilistic approach is developed for unsupervised chemical process monitoring and fault classification in this article. The spherical centers and radii of different clusters corresponding to normal and various kinds of faulty operations are estimated in the kernel feature space. Then the geometric distance of the monitored samples to different cluster centers and boundary support vectors are computed so that the distance-ratio-based probabilistic-like index can be further defined. Thus, the most probable clusters can be assigned to the monitored samples for fault detection and classification. The proposed SVC monitoring approach is applied to two test scenarios in the Tennessee Eastman Chemical process and its results are compared to those of the conventional K-nearest neighbor Fisher discriminant analysis ( KNN-FDA) and K-nearest neighbor support vector machine ( KNN-SVM) methods. The result comparison demonstrates the superiority of the SVC-based probabilistic approach over the traditional KNN-FDA and KNN-SVM methods in terms of fault detection and classification accuracies. © 2012 American Institute of Chemical Engineers AIChE J, 59: 407-419, 2013 [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
46. MMSVC: An efficient unsupervised learning approach for large-scale datasets
- Author
-
Gu, Hong, Zhao, Guangzhou, and Zhang, Jianliang
- Subjects
- *
SUPPORT vector machines , *CLUSTER analysis (Statistics) , *SPHERICAL data , *SCALABILITY , *LARGE scale systems , *ALGORITHMS , *NUMERICAL analysis - Abstract
Abstract: We propose a multi-scale, hierarchical framework to extend the scalability of support vector clustering (SVC). Based on multi-sphere support vector clustering, the clustering algorithm called multi-scale multi-sphere support vector clustering (MMSVC) works in a coarse-to-fine and top-to-down manner. Given one parent cluster, the next learning scale is generated by a secant-like numerical algorithm. A local quantity called spherical support vector density (sSVD) is proposed as a cluster validity measure to describe the compactness of the cluster. It is used as a terminate term in our framework. When dealing with large-scale dataset, our method benefits from the easy parameters tuning (robustness of parameters with respect to the clustering result) and the learning efficiency. We took 1.5 million tiny images to evaluate the method. Experimental result demonstrated that our method greatly improved the scalability and learning efficiency of support vector clustering. [Copyright &y& Elsevier]
- Published
- 2012
- Full Text
- View/download PDF
47. Gaussian kernel width exploration and cone cluster labeling for support vector clustering.
- Author
-
Lee, Sei-Hyung and Daniels, Karen
- Subjects
- *
GAUSSIAN processes , *KERNEL functions , *SUPPORT vector machines , *MACHINE learning , *CLUSTER analysis (Statistics) - Abstract
The process of clustering groups together data points so that intra-cluster similarity is maximized while inter-cluster similarity is minimized. Support vector clustering (SVC) is a clustering approach that can identify arbitrarily shaped cluster boundaries. The execution time of SVC depends heavily on several factors: choice of the width of a kernel function that determines a nonlinear transformation of the input data, solution of a quadratic program, and the way that the output of the quadratic program is used to produce clusters. This paper builds on our prior SVC research in two ways. First, we propose a method for identifying a kernel width value in a region where our experiments suggest that clustering structure is changing significantly. This can form the starting point for efficient exploration of the space of kernel width values. Second, we offer a technique, called cone cluster labeling, that uses the output of the quadratic program to build clusters in a novel way that avoids an important deficiency present in previous methods. Our experimental results use both two-dimensional and high-dimensional data sets. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
48. Technology forecasting using matrix map and patent clustering.
- Author
-
Sunghae Jun, Park, Sang Sung, and Jang, Dong Sik
- Subjects
FORECASTING ,TECHNOLOGICAL innovations ,MATHEMATICAL mappings ,SUPPORT vector machines ,DOCUMENT clustering ,STRATEGIC planning ,PATENTS ,MATHEMATICAL models - Abstract
Purpose – The purpose of this paper is to propose an objective method for technology forecasting (TF). For the construction of the proposed model, the paper aims to consider new approaches to patent mapping and clustering. In addition, the paper aims to introduce a matrix map and K-medoids clustering based on support vector clustering (KM-SVC) for vacant TF. Design/methodology/approach – TF is an important research and development (R&D) policy issue for both companies and government. Vacant TF is one of the key technological planning methods for improving the competitive power of firms and governments. In general, a forecasting process is facilitated subjectively based on the researcher's knowledge, resulting in unstable TF performance. In this paper, the authors forecast the vacant technology areas in a given technology field by analyzing patent documents and employing the proposed matrix map and KM-SVC to forecast vacant technology areas in the management of technology (MOT). Findings – The paper examines the vacant technology areas for MOT patent documents from the USA, Europe, and China by comparing these countries in terms of technology trends in MOT and identifying the vacant technology areas by country. The matrix map provides broad vacant technology areas, whereas KM-SVC provides more specific vacant technology areas. Thus, the paper identifies the vacant technology areas of a given technology field by using the results for both the matrix map and KM-SVC. Practical implications – The authors use patent documents as objective data to develop a model for vacant TF. The paper attempts to objectively forecast the vacant technology areas in a given technology field. To verify the performance of the matrix map and KM-SVC, the authors conduct an experiment using patent documents related to MOT (the given technology field in this paper). The results suggest that the proposed forecasting model can be applied to diverse technology fields, including R&D management, technology marketing, and intellectual property management. Originality/value – Most TF models are based on qualitative and subjective methods such as Delphi. That is, there are few objective models. In this regard, this paper proposes a quantitative and objective TF model that employs patent documents as objective data and a matrix map and KM-SVC as quantitative methods. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
49. A NOVEL SCHEME FOR ACCELERATING SUPPORT VECTOR CLUSTERING.
- Author
-
Yuan Ping, Yajian Zhou, Yixian Yang, and Habala, Ondrej
- Subjects
MATHEMATICAL optimization ,CLUSTER analysis (Statistics) ,VECTOR analysis ,CENTROIDAL Voronoi tessellations ,NOISE control - Abstract
Limited by two time-consuming steps, solving the optimization problem and labeling the data points with cluster labels, the support vector clustering (SVC) based algorithms, perform ineffectively in processing large datasets. This paper presents a novel scheme aimed at solving these two problems and accelerating the SVC. Firstly, an innovative definition of noise data points is proposed which can be applied in the design of noise elimination to reduce the size of a data set as well as to improve its separability without destroying the profile. Secondly, in the cluster labeling, a double centroids (DBC) labeling method, representing each cell of a cluster by the centroids of shape and density, is presented. This method is implemented towards accelerating this procedure and addressing the problem of labeling the original data set with irregular or imbalanced distribution. Compared with the state-of-the-art algorithms, the experimental results show that the proposed method significantly reduces the computational resources and improves the accuracy. Further analysis and experiments of semi-supervised cluster labeling confirm that the proposed DBC model is suitable for representing cells in clustering. [ABSTRACT FROM AUTHOR]
- Published
- 2012
50. Convex Decomposition Based Cluster Labeling Method for Support Vector Clustering.
- Author
-
Ping, Yuan, Tian, Ying-Jie, Zhou, Ya-Jian, and Yang, Yi-Xian
- Subjects
CONVEX programming ,ELECTRONIC file management ,IMAGE processing ,GAUSSIAN processes ,CONVEX domains - Abstract
Support vector clustering (SVC) is an important boundary-based clustering algorithm in multiple applications for its capability of handling arbitrary cluster shapes. However, SVC's popularity is degraded by its highly intensive time complexity and poor label performance. To overcome such problems, we present a novel efficient and robust convex decomposition based cluster labeling (CDCL) method based on the topological property of dataset. The CDCL decomposes the implicit cluster into convex hulls and each one is comprised by a subset of support vectors (SVs). According to a robust algorithm applied in the nearest neighboring convex hulls, the adjacency matrix of convex hulls is built up for finding the connected components; and the remaining data points would be assigned the label of the nearest convex hull appropriately. The approach's validation is guaranteed by geometric proofs. Time complexity analysis and comparative experiments suggest that CDCL improves both the efficiency and clustering quality significantly. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.