10,626 results on '"classification methods"'
Search Results
252. An automatic radar based aerial target recognition framework.
- Author
-
Agnihotri, Vikas and Sabharwal, Munish
- Subjects
- *
AUTOMATIC target recognition , *ROTATIONAL motion , *SUPPORT vector machines , *CRUISE missiles , *AIRSHIPS , *AIRPLANE ambulances , *RADAR - Abstract
Helicopters and Air Launched Cruise Missiles (ALCM) usually move at constant speeds as compared with other aerial targets. The distinguishing features for helicopters are the rotatory motions of its main and tail rotors whereas for ALCM are their rotational motion and wings. The present study proposes an automatic target recognition system framework for recognition of aerial targets, specifically Helicopters and ALCM. The study generates simulations of Micro-Doppler signatures for helicopters and ALCMs, followed by identification of their unique features for their classification. The study further applies various classification techniques on simulated dataset so produced to identify the aerial target. The classification results so produced by various classifiers are analyzed and compared, the best results to identify the Helicopters and ALCM, using their Micro-Doppler signatures, are produced by Support Vector Machine classifier. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
253. Data-Driven Gearbox Failure Detection in Industrial Robots.
- Author
-
Vallachira, Sathish, Orkisz, Michal, Norrlof, Mikael, and Butail, Sachit
- Abstract
Gearbox failures cost thousands of lost production hours in plants that use industrial robots. In this context, an automated monitoring system that can warn the user of an impending failure can save precious resources. This problem has been addressed in many other domains through the use of machine learning approaches. However, standard machine learning algorithms are limited in their ability to detect gearbox failures, mainly due to task variability arises from robot-specific data. To improve detection performance of machine learning approaches, in this paper we propose techniques to curate the data prior to building a classification model. In a systematic hypothesis–driven study exploring the effect of different preprocessing techniques, we evaluate training data augmentation with estimated measurements, data differencing to suppress task dependence, inclusion of local variation, and selection of principal components on data collected from 26 industrial robots from the field. Our results show that preprocessing techniques improve the failure detection performance. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
254. Exploration of Feature Extraction Methods and Dimension for sEMG Signal Classification.
- Author
-
Wu, Yutong, Hu, Xinhui, Wang, Ziwei, Wen, Jian, Kan, Jiangming, and Li, Wenbin
- Subjects
SIGNAL classification ,FEATURE extraction ,PRUNING ,PRINCIPAL components analysis ,SCIENCE databases ,DISCRIMINANT analysis ,AUTOMATIC control systems - Abstract
It is necessary to complete the two parts of gesture recognition and wireless remote control to realize the gesture control of the automatic pruning machine. To realize gesture recognition, in this paper, we have carried out the research of gesture recognition technology based on surface electromyography signal, and discussed the influence of different numbers and different gesture combinations on the optimal size. We have calculated the 630-dimensional eigenvector from the benchmark scientific database of sEMG signals and extracted the features using principal component analysis (PCA). Discriminant analysis (DA) has been used to compare the processing effects of each feature extraction method. The experimental results have shown that the recognition rate of four gestures can reach 100.0%, the recognition rate of six gestures can reach 98.29%, and the optimal size is 516~523 dimensions. This study lays a foundation for the follow-up work of the pruning machine gesture control, and p rovides a compelling new way to promote the creative and human computer interaction process of forestry machinery. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
255. Impacto de variables socioeconómicas en el desempeño estudiantil.
- Author
-
Jiménez Ramirez, Manuel Arturo, Parra, Edward, and Briceño, Marianela Luzardo
- Subjects
WASHING machines ,WRITTEN communication ,HIGHER education ,TEST scoring ,DATA analysis - Abstract
Copyright of Comunicaciones en Estadística is the property of Universidad Santo Tomas and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2019
256. The Influence of Data Classification Methods on Predictive Accuracy of Kernel Density Estimation Hotspot Maps.
- Author
-
Milic, Nenad, Popovic, Brankica, Mijalkovic, Sasa, and Marinkovic, Darko
- Published
- 2019
257. Joint distribution matching model for distribution–adaptation‐based cross‐project defect prediction.
- Author
-
Qiu, Shaojian, Lu, Lu, and Jiang, Siyu
- Abstract
Using classification methods to predict software defect is receiving a great deal of attention and most of the existing studies primarily conduct prediction under the within‐project setting. However, there usually had no or very limited labelled data to train an effective prediction model at an early phase of the software lifecycle. Thus, cross‐project defect prediction (CPDP) is proposed as an alternative solution, which is learning a defect predictor for a target project by using labelled data from a source project. Differing from previous CPDP methods that mainly apply instances selection and classifiers adjustment to improve the performance, in this study, the authors put forward a novel distribution–adaptation‐based CPDP approach, joint distribution matching (JDM). Specifically, JDM aims to minimise the joint distribution divergence between the source and target project to improve the CPDP performance. By constructing an adaptive weight vector for the instances of the source project, JDM can be effective and robust at reducing marginal distribution discrepancy and conditional distribution discrepancy simultaneously. Extensive experiments verify that JDM can outperform related distribution–adaptation‐based methods on 15 open‐source projects that are derived from two types of repositories. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
258. Modeling of Decision Trees Through P Systems.
- Author
-
Sempere, José M.
- Subjects
- *
DECISION trees , *TOPOLOGY - Abstract
In this paper, we propose a decision-tree modeling in the framework of membrane computing. We propose an algorithm to obtain a P system that is equivalent to any decision tree taken as input. In our case, and unlike previous proposals, we formulate the concepts of decision trees endogenously, since there is no external agent involved in the modeling. The tree structure can be defined naturally by the topology of the regions in the P system and the decision rules are defined by communication rules of the P system. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
259. Systemic early warning systems for EU14 based on the 2008 crisis: proposed estimation and model assessment for classification forecasting.
- Author
-
Papadopoulos, Savas, Stavroulias, Pantelis, and Sager, Thomas
- Subjects
ECONOMIC forecasting ,BANKING laws ,CRISES ,FINANCIAL crises ,PREDICTION models - Abstract
Reliable forecasts of an economic crisis well in advance of its onset could permit effective preventative measures to mitigate its consequences and become a valuable tool for banking regulation and macroprudential policy. Using the EU14 crisis of 2007–2008 as a template, we develop methodology that can accurately predict a banking crisis several quarters in advance in each country. The data for our predictions are standard, publicly available macroeconomic and market variables that are preprocessed by moving averages and filtering. The prediction models then utilize the filtered data to distinguish pre-crisis from normal quarters through standard statistical classification methodology plus one proposed method, enhanced by an innovative goodness-of-fit measure used in the estimation and in the threshold selection. Empirical results are quite satisfactory and can be used by policy makers, investors and researchers who are interested in estimating the probability of a crisis as much as one and a half years in advance in order to deploy prudential policies. Implications to bank regulatory policy are discussed. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
260. Region‐division‐based joint sparse representation classification for hyperspectral images.
- Author
-
Yan, Jingwen, Chen, Hongda, Zhai, Yikui, Liu, Yinan, and Liu, Lei
- Abstract
In this article, a region‐division‐based joint sparse representation classification (RDJSRC) method is proposed to solve the heterogeneous region problem in the joint sparse representation classification (JSRC) method used in hyperspectral image (HSI) classification. The RDJSRC method incorporates regional information, obtained by the hidden Markov random field (HMRF), into the JSRC to reduce the interference of heterogeneous pixels in the neighbourhood of the test pixel and finally improve the classification performance. The framework of this method is as follows. The first several principal components (PCs) are initially selected to be the new HSI by transforming the original HSI with the PC analysis algorithm. Then, the regional information containing the spatial structure of the HSI is obtained by applying the HMRF algorithm to the first PC. Through incorporating this regional information into the JSRC procedure, the initial label of the test pixel can be jointly determined by the new HSI pixels within the homogeneity in the search window. Ultimately, the final label of the test pixel is determined by a voting strategy based on multiple classification results. Compared with several classification methods, experimental results, indicate that this method achieves improvement from 2 to 3% in HSI classification. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
261. Evaluación de riesgos con Data Mining: el sistema financiero español.
- Author
-
Fernández Fernández, José Alejandro, Bejarano Vázquez, Virginia, and Virseda, Juan Antonio Vicente
- Abstract
Copyright of Mexican Journal of Economics & Finance / Revista Mexicana de Economia y Finanzas is the property of Instituto Mexicano de Ejecutivos de Finanzas and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2019
- Full Text
- View/download PDF
262. Motor Seslerine Göre Otomobillerin Tanınması.
- Author
-
Karaman, Efecan, Rende, Hikmet, and Akşahin, Mehmet Feyzi
- Abstract
Almost every moving vehicle produces a kind of sound. Automobile engine and powertrain, tires, exhaust, gearbox and suspension and other. sounds constitute significant sound sources in vehicles. These sounds, which are formed in vehicles, involve identifiable features that distinguish automobiles from each other like signatures belonging to the person. In this study, it was aimed to identify and different types of vehicles by using their engine sounds. These sounds were processed and then these sounds were classified by using an artificial neural networks classification method. As a result, 99.2% success was achieved and the models of the vehicles were recognized. [ABSTRACT FROM AUTHOR]
- Published
- 2019
263. Fuzzy Rule-Based Classification Method for Incremental Rule Learning
- Author
-
Hui Wang, Degang Chen, Jinhai Li, and Jiaojiao Niu
- Subjects
Reduct ,Fuzzy rule ,Computer science ,Applied Mathematics ,Granular computing ,Perspective (graphical) ,computer.software_genre ,Fuzzy logic ,Data type ,Task (project management) ,ComputingMethodologies_PATTERNRECOGNITION ,Computational Theory and Mathematics ,Artificial Intelligence ,Control and Systems Engineering ,Classification methods ,Data mining ,computer - Abstract
Granular rules have been extensively used for classification in fuzzy datasets to promote the advancement of artificial intelligence. However, due to the diversity of data types, how to improve the readability of the extracted granular rules while ensuring efficiency is always a challenge. Since granular reduct in granular computing can simplify real complex problem and dataset, this paper carries out granular rule learning from the perspective of granular reduct by taking FCA-based granular computing method as a framework. Specifically, for achieving classification task, we first propose a method to update the granular reduct, and then explore the updating mechanism of fuzzy granular rule in a reduced dataset. Secondly, a novel Fuzzy Rule based Classification Model named FRCM is presented for fuzzy granular rule learning. In order to verify the effectiveness of the proposed model, some numerical experiments for incremental learning and fuzzy rule mining are conducted to demonstrate that FRCM can achieve the state-of-the-art classification performance.
- Published
- 2022
- Full Text
- View/download PDF
264. Multi-Feature Fusion Method for Identifying Carotid Artery Vulnerable Plaque
- Author
-
R. Wu, L. Huang, Jiang Xie, G. Ding, M. Chi, X. Xu, Wenjun Zhang, and L. Liu
- Subjects
Carotid atherosclerosis ,business.industry ,Computer science ,Carotid arteries ,Biomedical Engineering ,Biophysics ,Pattern recognition ,medicine.disease_cause ,Convolutional neural network ,Vulnerable plaque ,Identification (information) ,Multi feature fusion ,medicine ,Classification methods ,Artificial intelligence ,business ,Feature set - Abstract
Purpose Vulnerable plaque of carotid atherosclerosis is prone to rupture, which can easily lead to acute cardiovascular and cerebrovascular accidents. Accurate identification of the vulnerable plaque is a challenging task, especially on limited datasets. Methods This paper proposes a multi-feature fusion method to identify high-risk plaque, in which three types of features are combined, i.e. global features of carotid ultrasound images, echo features of regions of interests (ROI) and expert knowledge from ultrasound reports. Due to the fusion of three types of features, more critical features for identifying high-risk plaque are included in the feature set. Therefore, better performance can be achieved even on limited datasets. Results From testing all combinations of three types of features, the results showed that the accuracy of using all three types of features is the highest. The experiments also showed that the performance of the proposed method is better than other plaque classification methods and classical Convolutional Neural Networks (CNNs) on the Plaque dataset. Conclusion The proposed method helped to build a more complete feature set so that the machine learning models could identify vulnerable plaque more accurately even on datasets with poor quality and small scale.
- Published
- 2022
- Full Text
- View/download PDF
265. Smart meter data classification using optimized random forest algorithm
- Author
-
Alireza Zakariazadeh
- Subjects
Smart meter ,business.industry ,Computer science ,Applied Mathematics ,Data classification ,Regular polygon ,computer.software_genre ,Computer Science Applications ,Random forest ,Electricity ,Control and Systems Engineering ,Classifier (linguistics) ,Cluster Analysis ,Classification methods ,Data mining ,Electrical and Electronic Engineering ,Cluster analysis ,business ,Instrumentation ,computer ,Algorithms - Abstract
Implementing a proper clustering algorithm and a high accuracy classifier for applying on electricity smart meter data is the first stage in analyzing and managing electricity consumption. In this paper, Random Forest (RF) classifier optimized by Artificial Bee Colony (ABC) which is called Artificial Bee Colony-based Random Forest (ABC-RF) is proposed. Also, in order to determine the representative load curves, the Convex Clustering (CC) is used. The solution paths generated by convex clustering show relationships among clusters that were hidden by static methods such as k-means clustering. To validate the proposed method, a case study that includes a real dataset of residential smart meters is implemented. The results evidence that the proposed ABC-RF method provides a higher accuracy if compared to other classification methods.
- Published
- 2022
- Full Text
- View/download PDF
266. Surrounding Vehicles’ Lane Change Maneuver Prediction and Detection for Intelligent Vehicles: A Comprehensive Review
- Author
-
Ruitao Song and Bin Li
- Subjects
Computer science ,business.industry ,Mechanical Engineering ,Inference ,ComputerApplications_COMPUTERSINOTHERSYSTEMS ,Feature selection ,Machine learning ,computer.software_genre ,Computer Science Applications ,User experience design ,Automotive Engineering ,Evaluation methods ,Classification methods ,Artificial intelligence ,business ,computer - Abstract
Identifying and evaluating the potential risks in the surrounding environment is critical for intelligent vehicles' safety and user experience. This paper provides a comprehensive overview of the state-of-the-art research on the surrounding vehicles' lane change maneuver prediction and detection. First, various driver behavior modeling and classification methods are reviewed and analyzed, which gives a general understanding of what the lane change maneuver is and how to predict or detect the lane change maneuver. Next, the primary sensing devices equipped on intelligent vehicles and their impacts on lane change inference systems are discussed. Then, a series of representative research works in recent years are selected, introduced, and compared regarding their input feature selection, inference algorithms, and performance evaluation methods. Finally, some potential future research directions are proposed. This paper aims to help the relevant researchers and institutions summarize the current studies on the surrounding vehicles' lane change maneuver inference and recognize its future development directions.
- Published
- 2022
- Full Text
- View/download PDF
267. Face Recognition Using Popular Deep Net Architectures: A Brief Comparative Study
- Author
-
Tony Gwyn, Kaushik Roy, and Mustafa Atay
- Subjects
Convolutional Neural Networks ,authentication ,biometrics ,face biometrics ,facial recognition ,classification methods ,Information technology ,T58.5-58.64 - Abstract
In the realm of computer security, the username/password standard is becoming increasingly antiquated. Usage of the same username and password across various accounts can leave a user open to potential vulnerabilities. Authentication methods of the future need to maintain the ability to provide secure access without a reduction in speed. Facial recognition technologies are quickly becoming integral parts of user security, allowing for a secondary level of user authentication. Augmenting traditional username and password security with facial biometrics has already seen impressive results; however, studying these techniques is necessary to determine how effective these methods are within various parameters. A Convolutional Neural Network (CNN) is a powerful classification approach which is often used for image identification and verification. Quite recently, CNNs have shown great promise in the area of facial image recognition. The comparative study proposed in this paper offers an in-depth analysis of several state-of-the-art deep learning based-facial recognition technologies, to determine via accuracy and other metrics which of those are most effective. In our study, VGG-16 and VGG-19 showed the highest levels of image recognition accuracy, as well as F1-Score. The most favorable configurations of CNN should be documented as an effective way to potentially augment the current username/password standard by increasing the current method’s security with additional facial biometrics.
- Published
- 2021
- Full Text
- View/download PDF
268. Forest canopy density assessment using different approaches - Review
- Author
-
Azadeh ABDOLLAHNEJAD, Dimitrios PANAGIOTIDIS, and Peter SUROVÝ
- Subjects
satellite sensors ,vegetation index ,classification methods ,pixel based ,spectral analyses ,Forestry ,SD1-669.5 - Abstract
Crown canopy is a significant regulator of forest, affecting microclimate, soil conditions and having an undeniable role in a forest ecosystem. Among the different materials and approaches that have been used for the estimation of crown canopy, satellite based methods are among the most successful methods regarding cost-saving efforts and different kinds of options for measuring the crown canopy. Different types of satellite sensors can result in different outputs due to their various spectral and spatial resolution, even when using the same methodologies. The aim of this review is to assess different remote sensing methods for forest crown canopy density assessment.
- Published
- 2017
- Full Text
- View/download PDF
269. A technique of the structural-tectonic elevations prediction using Earth remote sensing data
- Author
-
I. V. Tishaev, V. I. Zatserkovnyi, and K. P. Yagorlytska
- Subjects
remote sensing materials (RSM) ,geo-information systems (GIS) ,neotectonic elevations ,hydra-carbons ,classification methods ,Astronomy ,QB1-991 - Abstract
We consider an approach of using methods of Earth remote sensing data (RSD) classification for solving tasks of exploration geology and geophysics. Information obtained from the remote sensing data gives a possibility to clarify the structure of investigated areas and to determine neotectonic elevations, which act as certain indicators of promising areas with hydra-carbons contents. Reasonability of using such methods of RSD classification is based on connection between deep structure of surface resources (structural-tectonic setting) with current landscape, character of hydrologic network, geo-morphological, geo-botanical and other features. The advantage of Bayes classificator is not only in determination of object belonging to certain class, but also in calculation of probability of such belonging. For the formulated task this lets to forecast a presence of structural-tectonic elevations, which are potentially promising areas for hydra-carbons contents, using a formalized quantitative criterion. contents.
- Published
- 2016
- Full Text
- View/download PDF
270. New opportunities of the domestic universal classifications (LBC case)
- Author
-
N. Yu. Sokolova
- Subjects
library-bibliographic classification ,classification systems ,general scientific knowledge ,interdisciplinary knowledge ,formal sciences ,terminology ,classification methods ,classification principles ,Bibliography. Library science. Information resources - Abstract
The article deals with the problem of further development of the library-bibliographic classification (LBC), in particular, the issue of methodological development of Department 1 «General scientific and interdisciplinary knowledge». This topic raised by experts in late 1980s is still the subject of discussion both in the professional library and in the scientific community. The article presents a proposal on this issue compiled by specialists of the Center for studying informatics problems in INION RAS based of discussions and a subsequent report by LBC Chief Editor E. R. Sukiasyan. General positions of the Scientific-Methodological Council of the Research-Publishing Center on LSC development in the Russian Sate Library adopted as the results of the scientific community discussions are presented.
- Published
- 2016
- Full Text
- View/download PDF
271. A hierarchical interval outranking approach with interacting criteria
- Author
-
Efrain Solares, Jorge Navarro, and Eduardo Fernandez
- Subjects
Information Systems and Management ,Theoretical computer science ,General Computer Science ,Hierarchy (mathematics) ,Generalization ,Computer science ,Sorting ,Model parameters ,Interval (mathematics) ,Management Science and Operations Research ,Industrial and Manufacturing Engineering ,Set (abstract data type) ,Development (topology) ,Modeling and Simulation ,Classification methods - Abstract
Complex decision-making problems can be conveniently decomposed in more manageable sub-problems through a hierarchical approach. Often, there are interaction effects between sub-criteria or elementary criteria in the hierarchy. In this paper, we propose a generalization of the recently published interval outranking approach to deal with this kind of problems. This new method allows an easy setting of weights (including interaction weights), and other model parameters as interval numbers. The proposal also allows to set outranking relations associated with each non-elementary criterion. On this background, the INTERCLASS-nB and INTERCLASS-nC multi-criteria ordinal classification (multi-criteria sorting) methods are generalized to cope with interacting and hierarchical criteria; these methods can suggest assignments of actions to ordered classes for each non-elementary criterion. Finally, an application of the classification methods is illustrated by evaluating research and development projects.
- Published
- 2022
- Full Text
- View/download PDF
272. Segmentation of Passenger Electric Cars Market in Poland
- Author
-
Jakub Kubiczek and Bartłomiej Hadasik
- Subjects
market segmentation ,classification methods ,EV market ,passenger electric cars ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 ,Transportation engineering ,TA1001-1280 - Abstract
Striving to achieve sustainable development goals and taking care of the environment into the policies of car manufacturers forced the search for alternative sources of vehicle propulsion. One way to implement a sustainable policy is to use electric motors in cars. The observable development of the electric car market provides consumers with a wide spectrum of choices for a specific model that would meet their expectations. Currently, there are 53 different electric car models on the primary market in Poland. The aim of the article was to present the performed market segmentation, focused on identifying the similarities in the characteristics of electric car models on the Polish market and proposing their groupings. Based on the classification by the hierarchical cluster analysis algorithm (Ward’s method, squared Euclidean distance), the market division into 2, 3, and 4 groups was proposed. The Polish EV market segmentation took place not only in terms of the size and class of the car but primarily in terms of performance and overall quality of the vehicle. The performed classification did not change when the price was additionally included as a variable. It was also proposed to divide the market into 4 segments named: Premium, City, Small, and Sport. The segmentation carried out in this way helps to better understand the structure of the electric car market.
- Published
- 2021
- Full Text
- View/download PDF
273. Research methods used in library and information science during the 1970-2010
- Author
-
Veronica Gauchi Risso
- Published
- 2016
- Full Text
- View/download PDF
274. A Comparative Study of Cancer Classification Methods Using Microarray Gene Expression Profile
- Author
-
Alshamlan, Hala, Badr, Ghada, Alohali, Yousef, Herawan, Tutut, editor, Deris, Mustafa Mat, editor, and Abawajy, Jemal, editor
- Published
- 2014
- Full Text
- View/download PDF
275. Towards an Ensemble Learning Strategy for Metagenomic Gene Prediction
- Author
-
Goés, Fabiana, Alves, Ronnie, Corrêa, Leandro, Chaparro, Cristian, Thom, Lucinéia, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Kobsa, Alfred, Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Istrail, Sorin, Series editor, Pevzner, Pavel, Series editor, Waterman, Michael S., Series editor, and Campos, Sérgio, editor
- Published
- 2014
- Full Text
- View/download PDF
276. Application of Machine Learning for the Automation of the Quality Control of Noise Filtering Processes in Seismic Data Imaging
- Author
-
Mohamed Mejri and Maiza Bekara
- Subjects
QC denoise automation ,feature transformation techniques ,classification methods ,Geology ,QE1-996.5 - Abstract
Seismic imaging is the main technology used for subsurface hydrocarbon prospection. It provides an image of the subsurface using the same principles as ultrasound medical imaging. As for any data acquired through hydrophones (pressure sensors) and/or geophones (velocity/acceleration sensors), the raw seismic data are heavily contaminated with noise and unwanted reflections that need to be removed before further processing. Therefore, the noise attenuation is done at an early stage and often while acquiring the data. Quality control (QC) is mandatory to give confidence in the denoising process and to ensure that a costly data re-acquisition is not needed. QC is done manually by humans and comprises a major portion of the cost of a typical seismic processing project. It is therefore advantageous to automate this process to improve cost and efficiency. Here, we propose a supervised learning approach to build an automatic QC system. The QC system is an attribute-based classifier that is trained to classify three types of filtering (mild = under filtering, noise remaining in the data; optimal = good filtering; harsh = over filtering, the signal is distorted). The attributes are computed from the data and represent geophysical and statistical measures of the quality of the filtering. The system is tested on a full-scale survey (9000 km2) to QC the results of the swell noise attenuation process in marine seismic data.
- Published
- 2020
- Full Text
- View/download PDF
277. Classification Methods for Airborne Disease Spores from Greenhouse Crops Based on Multifeature Fusion
- Author
-
Yafei Wang, Xiaoxue Du, Guoxin Ma, Yong Liu, Bin Wang, and Hanping Mao
- Subjects
greenhouse ,disease ,airborne spore ,classification methods ,Technology ,Engineering (General). Civil engineering (General) ,TA1-2040 ,Biology (General) ,QH301-705.5 ,Physics ,QC1-999 ,Chemistry ,QD1-999 - Abstract
Airborne fungal spores have always played an important role in the spread of fungal crop diseases, causing great concern. The traditional microscopic spore classification method mainly relies on naked eye observations and classification by professional and technical personnel in a laboratory. Due to the large number of spores captured, this method is labor-intensive, time-consuming, and inefficient, and sometimes leads to huge errors. Thus, an alternative method is required. In this study, a method was proposed to identify airborne disease spores from greenhouse crops using digital image processing. First, in an indoor simulation, images of airborne disease spores from three greenhouse crops were collected using portable volumetric spore traps. Then, a series of image preprocessing methods were used to identify the spores, including mean filtering, Gaussian filtering, OTSU (maximum between-class variance) method binarization, morphological operations, and mask operations. After image preprocessing, 90 features of the spores were extracted, including color, shape, and texture features. Based on these features, logistics regression (LR), K nearest neighbor (KNN), random forest (RF), and support vector machine (SVM) classification models were built. The test results showed that the average accuracy rates for the 3 classes of disease spores using the SVM model, LR model, KNN model, and RF model were 94.36%, 90.13%, 89.37%, and 89.23%, respectively. The harmonic average of the accuracy and the recall rate value (F value) were higher for the SVM model and its overall average value reached 91.68%, which was 2.03, 3.59, and 3.96 percentage points higher than the LR model, KNN model, and RF model, respectively. Therefore, this method can effectively identify 3 classes of diseases spores and this study can provide a reference for the identification of greenhouse disease spores.
- Published
- 2020
- Full Text
- View/download PDF
278. A Comparison of Classification Techniques to Predict Brain-Computer Interfaces Accuracy Using Classifier-Based Latency Estimation
- Author
-
Md Rakibul Mowla, Jesus D. Gonzalez-Morales, Jacob Rico-Martinez, Daniel A. Ulichnie, and David E. Thompson
- Subjects
brain-computer interfaces (BCI) ,classification methods ,P300 speller ,P3 latency estimation ,sparse autoencoders (SAE) ,Neurosciences. Biological psychiatry. Neuropsychiatry ,RC321-571 - Abstract
P300-based Brain-Computer Interface (BCI) performance is vulnerable to latency jitter. To investigate the role of latency jitter on BCI system performance, we proposed the classifier-based latency estimation (CBLE) method. In our previous study, CBLE was based on least-squares (LS) and stepwise linear discriminant analysis (SWLDA) classifiers. Here, we aim to extend the CBLE method using sparse autoencoders (SAE) to compare the SAE-based CBLE method with LS- and SWLDA-based CBLE. The newly-developed SAE-based CBLE and previously used methods are also applied to a newly-collected dataset to reduce the possibility of spurious correlations. Our results showed a significant (p<0.001) negative correlation between BCI accuracy and estimated latency jitter. Furthermore, we also examined the effect of the number of electrodes on each classification technique. Our results showed that on the whole, CBLE worked regardless of the classification method and electrode count; by contrast the effect of the number of electrodes on BCI performance was classifier dependent.
- Published
- 2020
- Full Text
- View/download PDF
279. Liquid Chromatographic Approach for the Discrimination and Classification of Cava Samples Based on the Phenolic Composition Using Chemometric Methods
- Author
-
Anaïs Izquierdo-Llopart and Javier Saurina
- Subjects
cava (sparkling wine) ,liquid chromatography ,phenolic acids ,principal component analysis ,classification methods ,partial least square-discriminant analysis ,Nutrition. Foods and food supply ,TX341-641 ,Nutritional diseases. Deficiency diseases ,RC620-627 - Abstract
Phenolic profiles obtained by liquid chromatography with UV/vis detection were here exploited to classify cava samples from the protected designation of origin Cava. Wine samples belonging to various classes which differed in grape varieties, blends and fermentation processes were studied based on profiling and fingerprinting approaches. Hence, concentrations of relevant phenolic acids and chromatograms registered at 310 nm were preliminarily examined by Principal Component Analysis (PCA) to extract information on cava classes. It was found that various hydroxybenzoic and hydroxycinnamic acids such as gallic, gentisic, caffeic or caftaric acids were up- or down-expressed depending on the wine varieties. Additionally, Partial Least Squares Discriminant Analysis (PLS-DA) was applied to classify the cava samples according to varietal origins and blends. The classification models were established using well-known wines as the calibration standards. Subsequently, models were applied to assign unknown samples to their corresponding classes. Excellent classification rates were obtained thus proving the potentiality of the proposed approach for characterization and authentication purposes.
- Published
- 2020
- Full Text
- View/download PDF
280. Continual learning classification method with single-label memory cells based on the intelligent mechanism of the biological immune system
- Author
-
Kun Qian, Shulin Liu, Dong Li, Xin Sun, Lanlan Gong, and Ming Gu
- Subjects
Statistics and Probability ,Computer science ,business.industry ,Mechanism (biology) ,General Engineering ,Continual learning ,Machine learning ,computer.software_genre ,ComputingMethodologies_PATTERNRECOGNITION ,Artificial Intelligence ,Classification methods ,Artificial intelligence ,Biological immune system ,business ,computer ,Single label - Abstract
The traditional batch learning classification methods need to obtain all kinds of data once before training. This makes them unable to recognize the data from the unseen types and cannot continuously enhance their classification ability through learning the testing data in the testing process, because they lack continual learning ability. Inspired by the continual learning mechanism of the biological immune system (BIS), this paper proposed a continual learning classification method with single-label memory cells (S-CLCM). The type of testing data is identified by memory cells, and the data type from unseen types is determined by an affinity threshold. New memory cells are cultivated continuously by learning the testing data to enhance the classification ability of S-CLCM gradually. Every memory cell has the same size and a unique type. It becomes a standard batch learning classification method or a standard clustering method under certain conditions. Take the experiments on twenty benchmark datasets to estimate its classification performance and possible superiority. Results show S-CLCM has good performance when it becomes a standard batch learning classification method, and S-CLCM is superior to the other classical classification algorithms when the data from unseen types or new labeled data appear during the testing process. It can improve the classification accuracy by up to 33%, and by at least 14%.
- Published
- 2022
- Full Text
- View/download PDF
281. Combined RF-Based Drone Detection and Classification
- Author
-
Sofie Pollin, Sreeraj Rajendran, Sanjoy Basak, and Bart Scheers
- Subjects
Artificial Intelligence ,Computer Networks and Communications ,Hardware and Architecture ,Computer science ,Real-time computing ,SIGNAL (programming language) ,Classification methods ,Detection performance ,Drug trafficking ,Radio frequency ,Residual neural network ,Drone - Abstract
Despite several beneficial applications, unfortunately, drones are also being used for illicit activities such as drug trafficking, firearm smuggling or to impose threats to security-sensitive places like airports and nuclear power plants. The existing drone localization and neutralization technologies work on the assumption that the drone has already been detected and classified. Although we have observed a tremendous advancement in the sensor industry in this decade, there is no robust drone detection and classification method proposed in the literature yet. This paper focuses on radio frequency (RF) based drone detection and classification using the frequency signature of the transmitted signal. We have created a novel drone RF dataset using commercial drones and presented a detailed comparison between a two-stage and combined detection and classification framework. The detection and classification performance of both frameworks are presented for a single-signal and simultaneous multi-signal scenario. With detailed analysis, we show that You Only Look Once (YOLO) framework provides better detection performance compared to the Goodness-of-Fit (GoF) spectrum sensing for a simultaneous multi-signal scenario and good classification performance comparable to Deep Residual Neural Network (DRNN) framework.
- Published
- 2022
- Full Text
- View/download PDF
282. Classification Methods and Identification of Reniform Nematode Resistance in Known Soybean Cyst Nematode-Resistant Soybean Genotypes
- Author
-
Paula Agudelo, Henry T. Nguyen, Tri D. Vuong, Robert T. Robbins, Mariola Usovsky, Devany Crippen, Vijay Shankar, and Juliet Fultz Wilkes
- Subjects
Veterinary medicine ,Genotype ,biology ,Resistance (ecology) ,Cysts ,Heterodera ,Soybean cyst nematode ,Plant Science ,biology.organism_classification ,Plant disease ,Nematode ,Animals ,Classification methods ,Identification (biology) ,Soybeans ,Tylenchoidea ,Agronomy and Crop Science ,Plant Diseases - Abstract
Plant parasitic nematodes are a major yield-limiting factor of soybean in the United States and Canada. It has been indicated that soybean cyst nematode (SCN; Heterodera glycines Ichinohe) and reniform nematode (RN; Rotylenchulus reniformis Linford and Oliveira) resistance could be genetically related. For many years, fragmentary data have shown this relationship. This report evaluates RN reproduction on 418 plant introductions (PIs) selected from the U.S. Department of Agriculture Soybean Germplasm Collection with reported SCN resistance. The germplasm was divided into two tests of 214 PIs reported as resistant and 204 PIs reported as moderately resistant to SCN. The defining and reporting of RN resistance changed several times in the last 30 years, causing inconsistencies in RN resistance classification among multiple experiments. Comparison of four RN resistance classification methods was performed: (i) ≤10% as compared with the susceptible check, (ii) using normalized reproduction index (RI) values, and using (iii) transformed data log10(x), and (iv) transformed data log10(x + 1) in an optimal univariate k-means clustering analysis. The method of transformed data log10(x) was selected as the most accurate for classification of RN resistance. Among 418 PIs with reported SCN resistance, the log10(x) method grouped 59 PIs (15%) as resistant and 130 PIs (31%) as moderately resistant to RN. Genotyping of a subset of the most resistant PIs to both nematode species revealed their strong correlation with rhg1-a allele. This research identified genotypes with resistance to two nematode species and potential new sources of RN resistance that could be valuable to breeders in developing resistant cultivars.
- Published
- 2022
- Full Text
- View/download PDF
283. SegNet-based first-break picking via seismic waveform classification directly from shot gathers with sparsely distributed traces
- Author
-
Tao Xie, Shangxu Wang, Jie Qi, Sanyi Yuan, and Yue Zhao
- Subjects
Data processing ,Computer science ,business.industry ,Shot (filmmaking) ,Energy Engineering and Power Technology ,Binary number ,Geology ,Pattern recognition ,Geotechnical Engineering and Engineering Geology ,First break picking ,Geophysics ,Fuel Technology ,Geochemistry and Petrology ,Waveform ,Classification methods ,Probability distribution ,Economic Geology ,Artificial intelligence ,business ,TRACE (psycholinguistics) - Abstract
Manually picking regularly and densely distributed first breaks (FBs) are critical for shallow velocity-model building in seismic data processing. However, it is time consuming. We employ the fully-convolutional SegNet to address this issue and present a fast automatic seismic waveform classification method to pick densely-sampled FBs directly from common-shot gathers with sparsely distributed traces. Through feeding a large number of representative shot gathers with missing traces and the corresponding binary labels segmented by manually interpreted fully-sampled FBs, we can obtain a well-trained SegNet model. When any unseen gather including the one with irregular trace spacing is inputted, the SegNet can output the probability distribution of different categories for waveform classification. Then FBs can be picked by locating the boundaries between one class on post-FBs data and the other on pre-FBs background. Two land datasets with each over 2000 shots are adopted to illustrate that one well-trained 25-layer SegNet can favorably classify waveform and further pick fully-sampled FBs verified by the manually-derived ones, even when the proportion of randomly missing traces reaches 50%, 21 traces are missing consecutively, or traces are missing regularly.
- Published
- 2022
- Full Text
- View/download PDF
284. Analisis Sentimen Tweet KRI Nanggala 402 di Twitter menggunakan Metode Naïve Bayes Classifier
- Author
-
Eri Mardiani, Muhammad Ariel Djamaludin, and Agung Triayudi
- Subjects
Data source ,Naive Bayes classifier ,business.industry ,Computer science ,Sentiment analysis ,Classification methods ,Social media ,Artificial intelligence ,Crawling ,business ,computer.software_genre ,computer ,Natural language processing - Abstract
Social media is one of the technological developments that has contributed greatly in making it easy for us to communicate and socialize, one of which is using Twitter social media. Twitter in this study is used as a data source to analyze tweets discussing KRI Nanggala 402. Analysis of KRI Nanggala 402 twitter sentiment is used to see the tendency of public responses to the sinking of the KRI Nanggala 402 submarine whether to give positive or negative opinions. This Sentiment analysis uses the Naïve Bayes Classifier method, which is a classification method. The first research stage is crawling, processing, classification, and evaluation. The classification stage is carried out after the processing phase, where the classification results tend to be positive or negative, using the Naïve Bayes Classifier method. The accuracy of the system in the Sentiment analysis of the KRI Nanggala 402 tweet is 73.00%.
- Published
- 2022
- Full Text
- View/download PDF
285. Massive IoT Malware Classification Method Using Binary Lifting
- Author
-
Hae-Seon Jeong and Jin Kwak
- Subjects
Computer science ,business.industry ,Binary number ,computer.software_genre ,Theoretical Computer Science ,Computational Theory and Mathematics ,Artificial Intelligence ,Classification methods ,Malware ,Data mining ,Internet of Things ,business ,computer ,Software - Published
- 2022
- Full Text
- View/download PDF
286. Fully Automated Classification Method for Crops Based on Spatiotemporal Deep-Learning Fusion Technology
- Author
-
Shuting Yang, Lingjia Gu, Xiaofeng Li, Tao Jiang, and Fang Gao
- Subjects
business.industry ,Computer science ,Active learning (machine learning) ,Deep learning ,Machine learning ,computer.software_genre ,Convolutional neural network ,Range (mathematics) ,Sliding window protocol ,Digital image processing ,General Earth and Planetary Sciences ,Classification methods ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,computer ,Selection (genetic algorithm) - Abstract
Accurate and timely crop mapping is essential for agricultural applications, and deep-learning methods have been applied on a range of remotely sensed data sources to classify crops. In this article, we develop a novel crop classification method based on spatiotemporal deep-learning fusion technology. However, for crop mapping, the selection and labeling of training samples is expensive and time consuming. Therefore, we propose a fully automated training-sample-selection method. First, we design the method according to image processing algorithms and the concept of a sliding window. Second, we develop the Geo-3D convolutional neural network (CNN) and Geo-Conv1D for crop classification using time-series Sentinel-2 imagery. Specifically, we integrate geographic information of crops into the structure of deep-learning networks. Finally, we apply an active learning strategy to integrate the classification advantages of Geo-3D CNN and Geo-Conv1D. Experiments conducted in Northeast China show that the proposed sampling method can reliably provide and label a large number of samples and achieve satisfactory results for different deep-learning networks. Based on the automatic selection and labeling of training samples, the crop classification method based on spatiotemporal deep-learning fusion technology can achieve the highest overall accuracy (OA) with approximately 92.50% as compared with Geo-Conv1D (91.89%) and Geo-3D CNN (91.27%) in the three study areas, indicating that the proposed method is effective and efficient in multi-temporal crop classification.
- Published
- 2022
- Full Text
- View/download PDF
287. Application of the Layered Algorithm in search of an airborne contaminant source
- Author
-
Miroslaw Szaban and Anna Wawrzynczak
- Subjects
Computer Networks and Communications ,Computer science ,Gaussian ,Function (mathematics) ,Cellular automaton ,Theoretical Computer Science ,Domain (software engineering) ,symbols.namesake ,Artificial Intelligence ,Hardware and Architecture ,Dispersion (optics) ,symbols ,Model simulation ,Classification methods ,Algorithm ,Software - Abstract
The paper presents a new method of optimization by the Layered Algorithm (LA). The proposed algorithm reduces the initial area to the sub-area containing the optimum. The proposed technique is based on the classification of the optimized function values (data sampled by sensors). The classification method uses a two-dimensional three-state Cellular Automata (CA). The CA classifies all area points ascribed to the CA cells based on their values. Specification of the categorization layers to the data gives a possibility to identify the different levels areas. Consequently, after analysis, a sub-area containing the optimum can be designated. In this paper, the proposed algorithm is applied to find the location of the airborne contaminant source by analyzing the concentration of released substances reported by mobile sensors distributed over the domain of interest. The Gaussian dispersion model simulation of the contaminant dispersion in the urbanized area is applied to generate the data used to verify the efficiency of the proposed Layered Algorithm. The LA successfully estimates the sub-area of the considered domain where the contamination source is located, taking to account data from sensors solely.
- Published
- 2022
- Full Text
- View/download PDF
288. A Novel Classification Method with Cubic Spline Interpolation
- Author
-
Husam Ali Abdulmohsin, Hala Bahjat Abdul Wahab, and Abdul Mohssen Jaber Abdul Hossen
- Subjects
Computational Theory and Mathematics ,Artificial Intelligence ,Computer science ,Classification methods ,Spline interpolation ,Algorithm ,Software ,Theoretical Computer Science - Published
- 2022
- Full Text
- View/download PDF
289. Multielement Classification of a Small Fragmented Planting Farm Using Hyperspectral Unmanned Aerial Vehicle Image
- Author
-
Feiyu Peng, Yong Xie, Zui Tao, Qiancheng Dai, and Shao Wen
- Subjects
Artificial neural network ,Computer science ,Feature (computer vision) ,Yangtze river ,Classification methods ,Hyperspectral imaging ,Electrical and Electronic Engineering ,Geotechnical Engineering and Engineering Geology ,Scale (map) ,Remote sensing ,Image (mathematics) - Abstract
Aiming at identifying cropland in the Yangtze River Delta, we used unmanned aerial vehicle (UAV) to obtain high spatial and spectral resolution (HSSR) remote sensing images of a small farm in the southern Jiangsu Province. After feature augmentation and compression, we used two 3D-CNN algorithms and the baseline neural network (Baseline-NN) algorithm to classify the UAV-HSSR images. The classification results showed that these three classification methods could achieve the fine scale classification of all elements in the study area, with an overall accuracy of 86.560%, 85.416%, and 94.926%, and Kappa coefficients of 0.846, 0.833, and 0.936, respectively. The findings of this study indicate that hyperspectral UAV images have significant potential in the classification tasks of highly fragmented small farms, although the salt and pepper phenomenon was observed in the results of the three classification methods.
- Published
- 2022
- Full Text
- View/download PDF
290. Entropy-based network traffic anomaly classification method resilient to deception
- Author
-
Slavko Gajin and Juma Ibrahim
- Subjects
General Computer Science ,Computer science ,media_common.quotation_subject ,Classification methods ,Statistical physics ,Deception ,Anomaly (physics) ,Entropy (energy dispersal) ,media_common - Abstract
Entropy-based network traffic anomaly detection techniques are attractive due to their simplicity and applicability in a real-time network environment. Even though flow data provide only a basic set of information about network communications, they are suitable for efficient entropy-based anomaly detection techniques. However, a recent work reported a serious weakness of the general entropy-based anomaly detection related to its susceptibility to deception by adding spoofed data that camouflage the anomaly. Moreover, techniques for further classification of the anomalies mostly rely on machine learning, which involves additional complexity. We address these issues by providing two novel approaches. Firstly, we propose an efficient protection mechanism against entropy deception, which is based on the analysis of changes in different entropy types, namely Shannon, R?nyi, and Tsallis entropies, and monitoring the number of distinct elements in a feature distribution as a new detection metric. The proposed approach makes the entropy techniques more reliable. Secondly, we have extended the existing entropy-based anomaly detection approach with the anomaly classification method. Based on a multivariate analysis of the entropy changes of multiple features as well as aggregation by complex feature combinations, entropy-based anomaly classification rules were proposed and successfully verified through experiments. Experimental results are provided to validate the feasibility of the proposed approach for practical implementation of efficient anomaly detection and classification method in the general real-life network environment.
- Published
- 2022
- Full Text
- View/download PDF
291. Developing a Methodology for the Formation of a System of Attributes of Pathological Vascular Changes in the Fundus
- Author
-
Adal’bievich Tatarkanov, Aslan, Khasanovich Lampezhev, Abas, Khalitovich Tekeev, Ruslan, Alekseevich Marenkov, Dmitry, and Mikhailovich Chervyakov, Leonid
- Subjects
vascular pathology diagnosis ,classification methods ,clustering methods ,discriminant analysis ,evidence-based medicine - Abstract
This research aimed to develop a methodology for extracting data from diagnostic images of the fundus blood vessels and methods for their high-precision evaluation, focused on ensuring the diagnostic process standardization, reducing the time of examination and its cost within the framework of evidence-based medicine. Timely and competent diagnosis plays an important role in obtaining an optimal result for treating vascular pathologies. This research evaluated the effectiveness of existing approaches to the analysis of the geometric diagnostic attributes of the fundus vascular system state reflected in the images, which are necessary for identifying pathological vascular changes. A technique was developed for the formation of an optimal system of geometric diagnostic attributes according to the criterion of separability. It was shown that the most effective method for solving this problem is the discriminant analysis method, which, in the presence of a strong connection between certain groups of attributes, makes it possible to decide whether it is expedient to use them and reduce the dimension of the attribute space. Reducing the dimension can significantly reduce the number of calculations. The results of using full-scale images of the fundus with active medical support confirmed the effectiveness of the developed technique.
- Published
- 2023
292. To find the best clustering method that can be used in data analysis system recommendation
- Author
-
Bilge, İhtiman Emre, Güvenoğlu, Erdal, and Maltepe Üniversitesi, Lisansüstü Eğitim Enstitüsü
- Subjects
Makine öğrenimi algoritmaları ,Machine learning ,Data analysis ,Classification methods ,Clustering methods ,Machine learning algorithms ,Veri analizi ,Kümeleme yöntemleri ,Sınıflandırma yöntemleri ,Makine öğrenimi - Abstract
Makine öğrenimi ile var olan veri ile 7 adet model üzerinden en yüksek skoru elde eden kümeleme yöntemi oluşturularak veriye göre en yüksek skor elde ettiğimiz algoritma mantığı geliştirilmiştir. Bu şekilde kısıtlı veri ile modeller ilişkilendirilip çıktıları analiz edilmiştir. Birden fazla mantık içeren modeller seçilerek, sınıflandırma mantığı ve benzeri veriler baz alınarak test puanı ile tahmin puanı arasında en başarılı algoritmalar oluşturulmuştur. Veri analizi 7 model kullanılarak yapılır ve puanlar üzerinden fonksiyonlar yardımıyla verilerinize göre kullanım amacına göre en iyi puan sonuçlarını vermektedir., With the existing data with machine learning, the clustering method that achieves the highest score over 7 models was created, and the algorithm logic, in which we obtained the highest score according to the data, was developed. In this way, models were associated with limited data and their outputs were analyzed. By choosing models containing more than one logic, the most successful algorithms were created between the test score and the prediction score on the basis of the classification logic and similar data. Data analysis is made using 7 models and it provides the best score results according to the purpose of use according to your data with the help of functions over the scores.
- Published
- 2023
293. Применение машинного обучения для определения порядка прилагательных в английском языке
- Subjects
обработка естественного языка ,методы классификации ,adjective ordering ,classification methods ,гиперонимы ,порядок прилагательных ,word vector representation ,hypernyms ,GloVe ,natural language processing ,векторное представление слов - Abstract
В статье рассматривается способ решения задачи упорядочивания прилагательных в предложении на английском языке путем определения их гиперонимов. Определение гиперонима можно свести к задаче классификации, поэтому в данной работе произведено сравнение наиболее популярных методов классификации в машинном обучении: метод поиска ближайших соседей, логистическая регрессия, классификатор дерева решений, метод опорных векторов и наивный байесовский метод. Модели были обучены на выборке, содержащей прилагательные и их гиперонимы. Для анализируемого прилагательного отбираются схожие уже классифицированные прилагательные из обучающей выборки и на основе этих данных определяется наиболее семантически подходящий гипероним. Информацию о схожести слов предлагается брать из готовых эмбеддингов GloVe. Используя технику gridsearch, были подобраны оптимальные значения гиперпараметров для метода поиска ближайших соседей K-Nearest Neighbors. С помощью метрик точности (precision), полноты (recall) и F1-меры было проанализировано качество классификации данных при использовании каждого из перечисленных выше методов. Так как готовых датасетов, состоящих из классифицированных прилагательных, на данный момент нет, то для измерений вручную было классифицировано 300 прилагательных., The article presents a methodology for solving the adjective ordering problem in English sentences by determining their hypernyms. The determining of a hypernym can be represented as a classification task; therefore, the most popular machine-learning classification methods were compared, they include the following: nearest neighbors method, logistic regression, decision classifier, support vector machine and naive Bayes method. The models were trained on a sample that contained adjectives and their hypernyms. For each adjective, similar adjectives from the training sample were selected; the most semantically appropriate hypernym was determined based on them. The use of information about word similarity from GloVe embeddings is proposed. The optimal values of hyperparameters for the K-Nearest Neighbors method were selected by means of the gridsearch technique. The quality of data classification was evaluated applying the metrics of precision, recall, and F1-measure for each of the methods. Since there were no ready-made datasets of classified adjectives, 300 adjectives were classified manually to create necessary samples., МОДЕЛИРОВАНИЕ, ОПТИМИЗАЦИЯ И ИНФОРМАЦИОННЫЕ ТЕХНОЛОГИИ, Выпуск 1 (40) 2023, Pages 28-29
- Published
- 2023
- Full Text
- View/download PDF
294. 200 years of naming dinosaurs: scientists call for overhaul of antiquated system.
- Author
-
Sanderson K
- Subjects
- Animals, History, 19th Century, History, 20th Century, History, 21st Century, Classification methods, Dinosaurs classification, Paleontology history, Paleontology methods, Paleontology trends
- Published
- 2024
- Full Text
- View/download PDF
295. SeqCode in the golden age of prokaryotic systematics.
- Author
-
Jiménez DJ and Rosado AS
- Subjects
- Classification methods, Genome, Bacterial, Phylogeny, Genome, Archaeal, Archaea classification, Archaea genetics, Bacteria classification, Bacteria genetics, Metagenomics methods, Terminology as Topic
- Abstract
The SeqCode is a new code of prokaryotic nomenclature that was developed to validate taxon names using genome sequences as the type material. The present article provides an independent view about the SeqCode, highlighting its history, current status, basic features, pros and cons, and use to date. We also discuss important topics to consider for validation of novel prokaryotic taxon names using genomes as the type material. Owing to significant advances in metagenomics and cultivation methods, hundreds of novel prokaryotic species are expected to be discovered in the coming years. This manuscript aims to stimulate and enrich the debate around the use of the SeqCode in the upcoming golden age of prokaryotic taxon discovery and systematics., (© The Author(s) 2024. Published by Oxford University Press on behalf of the International Society for Microbial Ecology.)
- Published
- 2024
- Full Text
- View/download PDF
296. Efficient phylogenetic tree inference for massive taxonomic datasets: harnessing the power of a server to analyze 1 million taxa.
- Author
-
Piñeiro C and Pichel JC
- Subjects
- Algorithms, Computational Biology methods, Classification methods, Databases, Genetic, Phylogeny, Software
- Abstract
Background: Phylogenies play a crucial role in biological research. Unfortunately, the search for the optimal phylogenetic tree incurs significant computational costs, and most of the existing state-of-the-art tools cannot deal with extremely large datasets in reasonable times., Results: In this work, we introduce the new VeryFastTree code (version 4.0), which is able to construct a tree on 1 server using single-precision arithmetic from a massive 1 million alignment dataset in only 36 hours, which is 3 times and 3.2 times faster than its previous version and FastTree-2, respectively. This new version further boosts performance by parallelizing all tree traversal operations during the tree construction process, including subtree pruning and regrafting moves. Additionally, it introduces significant new features such as support for new and compressed file formats, enhanced compatibility across a broader range of operating systems, and the integration of disk computing functionality. The latter feature is particularly advantageous for users without access to high-end servers, as it allows them to manage very large datasets, albeit with an increase in computing time., Conclusions: Experimental results establish VeryFastTree as the fastest tool in the state-of-the-art for maximum likelihood phylogeny estimation. It is publicly available at https://github.com/citiususc/veryfasttree. In addition, VeryFastTree is included as a package in Bioconda, MacPorts, and all Debian-based Linux distributions., (© The Author(s) 2024. Published by Oxford University Press GigaScience.)
- Published
- 2024
- Full Text
- View/download PDF
297. Species Diagnosis and DNA Taxonomy.
- Author
-
Ahrens D
- Subjects
- DNA Barcoding, Taxonomic methods, Classification methods, Phylogeny, Species Specificity, DNA genetics
- Abstract
The use of DNA has helped to improve and speed up species identification and delimitation. However, it also provides new challenges to taxonomists. Incongruence of outcome from various markers and delimitation methods, bias from sampling and skewed species distribution, implemented models, and the choice of methods/priors may mislead results and also may, in conclusion, increase elements of subjectivity in species taxonomy. The lack of direct diagnostic outcome from most contemporary molecular delimitation approaches and the need for a reference to existing and best sampled trait reference systems reveal the need for refining the criteria of species diagnosis and diagnosability in the current framework of nomenclature codes and good practices to avoid nomenclatorial instability, parallel taxonomies, and consequently more and new taxonomic impediment., (© 2024. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.)
- Published
- 2024
- Full Text
- View/download PDF
298. DNA Barcoding in Species Delimitation: From Genetic Distances to Integrative Taxonomy.
- Author
-
Miralles A, Puillandre N, and Vences M
- Subjects
- Classification methods, Phylogeny, Animals, Species Specificity, DNA Barcoding, Taxonomic methods
- Abstract
Over the past two decades, DNA barcoding has become the most popular exploration approach in molecular taxonomy, whether for identification, discovery, delimitation, or description of species. The present contribution focuses on the utility of DNA barcoding for taxonomic research activities related to species delimitation, emphasizing the following aspects:(1) To what extent DNA barcoding can be a valuable ally for fundamental taxonomic research, (2) its methodological and theoretical limitations, (3) the conceptual background and practical use of pairwise distances between DNA barcode sequences in taxonomy, and (4) the different ways in which DNA barcoding can be combined with complementary means of investigation within a broader integrative framework. In this chapter, we recall and discuss the key conceptual advances that have led to the so-called renaissance of taxonomy, elaborate a detailed glossary for the terms specific to this discipline (see Glossary in Chap. 35 ), and propose a newly designed step-by-step species delimitation protocol starting from DNA barcode data that includes steps from the preliminary elaboration of an optimal sampling strategy to the final decision-making process which potentially leads to nomenclatural changes., (© 2024. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.)
- Published
- 2024
- Full Text
- View/download PDF
299. Austenitic Stainless Steel EN 1.4404 Corrosion Detection Using Classification Techniques
- Author
-
Jiménez-Come, M. J., Muñoz, E., García, R., Matres, V., Martín, M. L., Trujillo, F., Turias, I., Kacprzyk, Janusz, editor, Corchado, Emilio, editor, Snášel, Václav, editor, Sedano, Javier, editor, Hassanien, Aboul Ella, editor, Calvo, José Luis, editor, and Ślȩzak, Dominik, editor
- Published
- 2011
- Full Text
- View/download PDF
300. A Robust Ensemble Classification Method Analysis
- Author
-
Zhang, Zhongwei, Li, Jiuyong, Hu, Hong, Zhou, Hong, and Arabnia, Hamid R., editor
- Published
- 2010
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.