194 results on '"Gozde Bozdagi Akar"'
Search Results
52. Temporal and spatial scaling for stereoscopic video compression.
- Author
-
Anil Aksay, Cagdas Bilen, Engin Kurutepe, Tanir Ozcelebi, Gozde Bozdagi Akar, M. Reha Civanlar, and A. Murat Tekalp
- Published
- 2006
53. Schemes for Multiple Description Coding of Stereoscopic Video.
- Author
-
Andrey Norkin, Anil Aksay, Cagdas Bilen, Gozde Bozdagi Akar, Atanas P. Gotchev, and Jaakko Astola
- Published
- 2006
- Full Text
- View/download PDF
54. A Comparison on Textured Motion Classification.
- Author
-
Kaan öztekin and Gozde Bozdagi Akar
- Published
- 2006
- Full Text
- View/download PDF
55. A Multi-View Video Codec Based on H.264.
- Author
-
Cagdas Bilen, Anil Aksay, and Gozde Bozdagi Akar
- Published
- 2006
- Full Text
- View/download PDF
56. Multiple Description Scalar Quantization Based 3D Mesh Coding.
- Author
-
M. Oguz Bici and Gozde Bozdagi Akar
- Published
- 2006
- Full Text
- View/download PDF
57. View point tracking for 3D display systems.
- Author
-
Yusuf Bediz and Gozde Bozdagi Akar
- Published
- 2005
58. Subjective evaluation of effects of spectral and spatial redundancy reduction on stereo images.
- Author
-
Anil Aksay, Cagdas Bilen, and Gozde Bozdagi Akar
- Published
- 2005
59. Evaluation of disparity map characteristics for stereo image coding.
- Author
-
Anil Aksay, M. Oguz Bici, and Gozde Bozdagi Akar
- Published
- 2005
- Full Text
- View/download PDF
60. GAYE: a face recognition system.
- Author
-
Burcu Kepenekci, Faik Boray Tek, Onur Cilingir, Ufuk Sakarya, and Gozde Bozdagi Akar
- Published
- 2004
- Full Text
- View/download PDF
61. Face Verification Competition on the XM2VTS Database.
- Author
-
Kieron Messer, Josef Kittler, Mohammad Sadeghi 0001, Sébastien Marcel, Christine Marcel, Samy Bengio, Fabien Cardinaux, Conrad Sanderson, Jacek Czyz, Luc Vandendorpe, Sanun Srisuk, Maria Petrou, Werasak Kurutach, Alexander Kadyrov, Roberto Paredes, Burcu Kepenekci, Faik Boray Tek, Gozde Bozdagi Akar, Farzin Deravi, and Nick Jeremy Mavity
- Published
- 2003
- Full Text
- View/download PDF
62. Occluded face recognition based on Gabor wavelets.
- Author
-
Burcu Kepenekci, Faik Boray Tek, and Gozde Bozdagi Akar
- Published
- 2002
- Full Text
- View/download PDF
63. Impact of scalability in video transmission in promotion-capable differentiated services networks.
- Author
-
Eren Gürses, Gozde Bozdagi Akar, and Nail Akar
- Published
- 2002
- Full Text
- View/download PDF
64. Simultaneous stereo-motion fusion and 3-D motion tracking.
- Author
-
Yücel Altunbasak, A. Murat Tekalp, and Gozde Bozdagi Akar
- Published
- 1995
- Full Text
- View/download PDF
65. SSIM Modelin Geliştirilmesine Dayanan Bir 3B Video Kalite Değerlendirme Metriği
- Author
-
Gökçe NUR YILMAZ and Gozde BOZDAGI AKAR
- Subjects
Engineering ,3B video,SSIM,video kalite değerlendirmesi ,Mühendislik - Abstract
Günümüzdeki en revaçta araştırma alanlarından birisi kullanıcılara gelişmiş çoklu-ortam servisleri sağlayabilmek adına 3 Boyutlu (3B) video Kalite Deneyimini (KD) etkin olarak tahmin eden objektif metriklerin geliştirilmesidir. Fakat, literatürde standartlaşmış ve yaygın kullanılan bir metrik henüz bulunmamaktadır. Bu yüzden, Structural SIMilarity Index (SSIM) gibi 2 Boyutlu (2B) video kalite ölçümünde sıklıkla kullanılan metrikler 3B video kalite ölçümünde de kullanılmaktadır. Ancak bu metrikler İnsan Görme Sitemini (İGS) etkileyen 3B video bağlantılı özellikleri içermedikleri için güvenilir 3B video kalite ölçümü sağlamaktan oldukça uzaktırlar. Bunları göz önüne alarak, bu çalışmada, SSIM, zıtlık, hareket ve yapısal bilgi karakteristikleri gibi İGS’yi etkileyen 3B video özellikleri ile geliştirilmiştir. Geliştirilen SSIM metriği kullanılarak elde edilen sonuçlar, bu metriğin gelişmiş çoklu-ortam servisleri sağlayabilmek açısından etkinliğini kanıtlamaktadır.
- Published
- 2021
66. 3D Video Quality Evaluation Based on SSIM Model Improvement
- Author
-
Gozde Bozdagi Akar and Gokce Nur Yilmaz
- Subjects
business.industry ,Computer science ,End user ,Structural similarity ,Research areas ,Video quality ,Machine learning ,computer.software_genre ,Feature (computer vision) ,Human visual system model ,Artificial intelligence ,business ,computer ,Reliability (statistics) - Abstract
In order to provide improved multimedia services to the end users, developing objective models efficiently predicting 3 Dimensional (3D) video Quality of Experience (QoE) can currently be considered as one of the most significant research areas. Nevertheless, there is currently no model standardized and widely utilized by the researchers due to its efficient and reliable assessment of the 3D video quality. Therefore, highly exploited 2 Dimensional (2D) video quality assessment models such as Structural SIMilarity Index (SSIM) are preferred for the 3D video quality evaluation. However, providing efficiency and reliability for the 3D video quality assessment using the 2D video quality assessment models can only be ensured if they include 3D video related features effecting Human Visual System (HVS). Under the light of these information, the SSIM model is improved for the 3D video quality assessment using perceptually significant feature, contrast and motion characteristics having impact on the HVS in this study. The results obtained by utilizing the improved SSIM model clearly present that the model is quite competent to provide enhanced multimedia services to the end users.
- Published
- 2021
67. A study on the effect of MPE-FEC for 3D video broadcasting over DVB-H.
- Author
-
Anil Aksay, M. Oguz Bici, Döne Bugdayci, Antti Tikänmaki, Atanas P. Gotchev, and Gozde Bozdagi Akar
- Published
- 2009
- Full Text
- View/download PDF
68. Demo paper: Real time 3D video streaming: A mobile approach.
- Author
-
Emin Zerman and Gozde Bozdagi Akar
- Published
- 2013
- Full Text
- View/download PDF
69. CHAOS Challenge - Combined (CT-MR) Healthy Abdominal Organ Segmentation
- Author
-
Savas Ozkan, N. Sinem Gezer, Dmitrii Lachinov, Debdoot Sheet, Fabian Isensee, Gozde Bozdagi Akar, M. Alper Selver, Soumick Chatterjee, Oliver Speck, A. Emre Kavur, Sinem Aslan, Josef Pauli, Oğuz Dicle, Gozde Unal, Pierre-Henri Conze, Andreas Nürnberger, Klaus H. Maier-Hein, Gurbandurdy Dovletov, Ronnie Rajan, Vladimir Groza, Rachana Sathish, Bora Baydar, Matthias Perkonigg, Shuo Han, Philipp Ernst, Duc Duy Pham, Mustafa Baris, Dokuz Eylül Üniversitesi = Dokuz Eylül University [Izmir] (DEÜ), University of Ca’ Foscari [Venice, Italy], Département lmage et Traitement Information (IMT Atlantique - ITI), IMT Atlantique Bretagne-Pays de la Loire (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT), Laboratoire de Traitement de l'Information Medicale (LaTIM), Université de Brest (UBO)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre Hospitalier Régional Universitaire de Brest (CHRU Brest)-IMT Atlantique Bretagne-Pays de la Loire (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Institut Brestois Santé Agro Matière (IBSAM), Université de Brest (UBO), MEDIAN Technologies, University of Duisburg-Essen, Otto-von-Guericke University [Magdeburg] (OVGU), Middle East Technical University [Ankara] (METU), Medizinische Universität Wien = Medical University of Vienna, Johns Hopkins University (JHU), German Cancer Research Center - Deutsches Krebsforschungszentrum [Heidelberg] (DKFZ), Department of Biomedical Imaging and Image-guided Therapy [Medical University of Vienna], Indian Institute of Technology Kharagpur (IIT Kharagpur), and Istanbul Technical University (ITÜ)
- Subjects
FOS: Computer and information sciences ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,Health Informatics ,Machine learning ,computer.software_genre ,Field (computer science) ,030218 nuclear medicine & medical imaging ,03 medical and health sciences ,0302 clinical medicine ,Segmentation ,Maschinenbau ,Abdomen ,Cross-modality ,FOS: Electrical engineering, electronic engineering, information engineering ,[INFO.INFO-IM]Computer Science [cs]/Medical Imaging ,Medical imaging ,Humans ,Radiology, Nuclear Medicine and imaging ,Challenge ,Set (psychology) ,Modality (human–computer interaction) ,Radiological and Ultrasound Technology ,Settore INF/01 - Informatica ,business.industry ,Deep learning ,Image and Video Processing (eess.IV) ,Electrical Engineering and Systems Science - Image and Video Processing ,Liver Segmentation ,Computer Graphics and Computer-Aided Design ,3. Good health ,CHAOS (operating system) ,Surface distance ,Informatik ,Liver ,[INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV] ,Computer Vision and Pattern Recognition ,Artificial intelligence ,Tomography, X-Ray Computed ,business ,Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni ,[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing ,computer ,Algorithms ,030217 neurology & neurosurgery - Abstract
Segmentation of abdominal organs has been a comprehensive, yet unresolved, research field for many years. In the last decade, intensive developments in deep learning (DL) introduced new state-of-the-art segmentation systems. Despite outperforming the overall accuracy of existing systems, the effects of DL model properties and parameters on the performance are hard to interpret. This makes comparative analysis a necessary tool towards interpretable studies and systems. Moreover, the performance of DL for emerging learning approaches such as cross-modality and multi-modal semantic segmentation tasks has been rarely discussed. In order to expand the knowledge on these topics, the CHAOS – Combined (CT-MR) Healthy Abdominal Organ Segmentation challenge was organized in conjunction with the IEEE International Symposium on Biomedical Imaging (ISBI), 2019, in Venice, Italy. Abdominal organ segmentation from routine acquisitions plays an important role in several clinical applications, such as pre-surgical planning or morphological and volumetric follow-ups for various diseases. These applications require a certain level of performance on a diverse set of metrics such as maximum symmetric surface distance (MSSD) to determine surgical error-margin or overlap errors for tracking size and shape differences. Previous abdomen related challenges are mainly focused on tumor/lesion detection and/or classification with a single modality. Conversely, CHAOS provides both abdominal CT and MR data from healthy subjects for single and multiple abdominal organ segmentation. Five different but complementary tasks were designed to analyze the capabilities of participating approaches from multiple perspectives. The results were investigated thoroughly, compared with manual annotations and interactive methods. The analysis shows that the performance of DL models for single modality (CT / MR) can show reliable volumetric analysis performance (DICE: 0.98 ± 0.00 / 0.95 ± 0.01), but the best MSSD performance remains limited (21.89 ± 13.94 / 20.85 ± 10.63 mm). The performances of participating models decrease dramatically for cross-modality tasks both for the liver (DICE: 0.88 ± 0.15 MSSD: 36.33 ± 21.97 mm). Despite contrary examples on different applications, multi-tasking DL models designed to segment all organs are observed to perform worse compared to organ-specific ones (performance drop around 5%). Nevertheless, some of the successful models show better performance with their multi-organ versions. We conclude that the exploration of those pros and cons in both single vs multi-organ and cross-modality segmentations is poised to have an impact on further research for developing effective algorithms that would support real-world clinical applications. Finally, having more than 1500 participants and receiving more than 550 submissions, another important contribution of this study is the analysis on shortcomings of challenge organizations such as the effects of multiple submissions and peeking phenomenon. © 2020 Elsevier B.V., 116E133, BIDEB-2214 College of Environmental Science and Forestry, State University of New York, ESF: 1059B191701102, BIDEB-2219, ZS/2016/08/80646 Türkiye Bilimsel ve Teknolojik Araştirma Kurumu, TÜBITAK, The organizers would like to thank Ivana Isgum and Tom Vercauteren in the challenge committee of ISBI 2019 for their guidance and support. We express our gratitude to supporting organizations of the grand-challenge.org platform. We thank Esranur Kazaz, Umut Baran Ekinci, Ece K?se, Fabian Isensee, David V?lgyes, and Javier Coronel for their contributions. Last but not least, our special thanks go to Ludmila I. Kuncheva for her valuable contributions. This work is supported by Scientific and Technological Research Council of Turkey (TUBITAK) ARDEB-EEEAG under grant number 116E133 and TUBITAK BIDEB-2214 International Doctoral Research Fellowship Programme. The work of P. Ernst, S. Chatterjee, O. Speck and, A. N?rnberger was conducted within the context of the International Graduate School MEMoRIAL at OvGU Magdeburg, Germany, supported by ESF (project no. ZS/2016/08/80646). The work of S. Aslan within the context of Ca? Foscari University of Venice is supported by under TUBITAK BIDEB-2219 grant no 1059B191701102., This work is supported by Scientific and Technological Research Council of Turkey (TUBITAK) ARDEB-EEEAG under grant number 116E133 and TUBITAK BIDEB-2214 International Doctoral Research Fellowship Programme. The work of P. Ernst, S. Chatterjee, O. Speck and, A. Nürnberger was conducted within the context of the International Graduate School MEMoRIAL at OvGU Magdeburg, Germany, supported by ESF (project no. ZS/2016/08/80646). The work of S. Aslan within the context of Ca’ Foscari University of Venice is supported by under TUBITAK BIDEB-2219 grant no 1059B191701102.
- Published
- 2021
70. Dental X-ray Image Segmentation using Octave Convolution Neural Network
- Author
-
Gozde Bozdagi Akar and Mete Can Kaya
- Subjects
Artificial neural network ,business.industry ,Computer science ,Image segmentation ,Convolutional neural network ,Object detection ,030218 nuclear medicine & medical imaging ,Convolution ,03 medical and health sciences ,0302 clinical medicine ,X ray image ,Octave ,Computer vision ,Segmentation ,Artificial intelligence ,business ,030217 neurology & neurosurgery - Abstract
In this paper, we present a Unet architecture made of octave convolution for dental image segmentation problem. In this architecture, the requirements for memory and accuracy are significantly improved compared to previous works in the literature. Compare to state-of-art models on this topic the classification accuracy in dental image segmentation is increased by %2, and the memory usage is decreased by %70. Suggested architecture showed a performance of success on ISBI2015 dataset.
- Published
- 2020
71. Comparison Of Semi-Automatic And Deep Learning-Based Automatic Methods For Liver Segmentation In Living Liver Transplant Donors
- Author
-
Savas Ozkan, Mustafa Baris, Gozde Bozdagi Akar, Çağlar Kılıkçıer, Ulaş Yüksel, M. Alper Selver, Naciye Sinem Gezer, Sahin Olut, Bora Baydar, Oğuz Dicle, A. Emre Kavur, Gozde Unal, Yusuf Huseyin Sahin, Bursa Uludağ Üniversitesi/Mühendislik Fakültesi/Elektronik Mühendisliği., Kılıkçıer, Çağlar, and AAH-3031-2021
- Subjects
Reproducibility of results ,Male ,Scoring system ,Initialization ,Procedures ,030218 nuclear medicine & medical imaging ,Radiology, nuclear medicine & medical imaging ,0302 clinical medicine ,Image Processing, Computer-Assisted ,Medicine ,Segmentation ,Abdominal Imaging ,Fast marching method ,Accuracy ,Ground truth ,Living donor ,Repeatability ,Multilevel ,Reproducibility ,Liver ,Region growing ,Radiologist ,Diagnostic imaging ,Female ,Organ size ,Segmentation algorithm ,Cardiology and Cardiovascular Medicine ,Convolutional neural-networks ,CNN ,Anatomy and histology ,Abdominal organs ,MRI ,Human ,Adult ,Contrast enhancement ,Clinical article ,Image processing ,Article ,03 medical and health sciences ,Computer assisted tomography ,Plant seed ,Humans ,Radiology, Nuclear Medicine and imaging ,Computer-assisted ,Living donors ,Liver graft ,X-ray computed tomography ,Liver transplantation ,business.industry ,Volume ,CT Image ,Dice ,Deep learning ,Pattern recognition ,Watershed ,Liver weight ,Artificial intelligence ,Comparative study ,Tomography, X-Ray computed ,business ,Controlled study ,Qualitative analysis ,Model - Abstract
Purpose To compare the accuracy and repeatability of emerging machine learning based (i.e. deep) automatic segmentation algorithms with those of well-established semi-automatic (interactive) methods for determining liver volume in living liver transplant donors at computerized tomography (CT) imaging. Methods A total of 12 (6 semi-, 6 full-automatic) methods are evaluated. The semi-automatic segmentation algorithms are based on both traditional iterative models including watershed, fast marching, region growing, active contours and modern techniques including robust statistical segmenter and super-pixels. These methods entail some sort of interaction mechanism such as placing initialization seeds on images or determining a parameter range. The automatic methods are based on deep learning and they include three framework templates (DeepMedic, NiftyNet and U-Net) the first two of which are applied with default parameter sets and the last two involve adapted novel model designs. For 20 living donors (6 training and 12 test datasets), a group of imaging scientists and radiologists created ground truths by performing manual segmentations on contrast material-enhanced CT images. Each segmentation is evaluated using five metrics (i.e. volume overlap and relative volume errors, average/RMS/maximum symmetrical surface distances). The results are mapped to a scoring system and a final grade is calculated by taking their average. Accuracy and repeatability were evaluated using slice by slice comparisons and volumetric analysis. Diversity and complementarity are observed through heatmaps. Majority voting and Simultaneous Truth and Performance Level Estimation (STAPLE) algorithms are utilized to obtain the fusion of the individual results. Results The top four methods are determined to be automatic deep models having 79.63, 79.46 and 77.15 and 74.50 scores. Intra-user score is determined as 95.14. Overall, deep automatic segmentation outperformed interactive techniques on all metrics. The mean volume of liver of ground truth is found to be 1409.93 mL ± 271.28 mL, while it is calculated as 1342.21 mL ± 231.24 mL using automatic and 1201.26 mL ± 258.13 mL using interactive methods, showing higher accuracy and less variation on behalf of automatic methods. The qualitative analysis of segmentation results showed significant diversity and complementarity enabling the idea of using ensembles to obtain superior results. The fusion of automatic methods reached 83.87 with majority voting and 86.20 using STAPLE that are only slightly less than fusion of all methods that achieved 86.70 (majority voting) and 88.74 (STAPLE). Conclusion Use of the new deep learning based automatic segmentation algorithms substantially increases the accuracy and repeatability for segmentation and volumetric measurements of liver. Fusion of automatic methods based on ensemble approaches exhibits best results almost without any additional time cost due to potential parallel execution of multiple models.
- Published
- 2020
72. Multiple description coding of 3D dynamic meshes based on temporal subsampling.
- Author
-
M. Oguz Bici and Gozde Bozdagi Akar
- Published
- 2010
- Full Text
- View/download PDF
73. Wavelet-based multiple description coding of 3-D geometry.
- Author
-
Andrey Norkin, M. Oguz Bici, Gozde Bozdagi Akar, Atanas P. Gotchev, and Jaakko Astola
- Published
- 2007
- Full Text
- View/download PDF
74. Optimal packet scheduling and rate control for video streaming.
- Author
-
Eren Gürses, Gozde Bozdagi Akar, and Nail Akar
- Published
- 2007
- Full Text
- View/download PDF
75. Two-Way/Hybrid Clustering Architecture for Peer to Peer Systems.
- Author
-
Kasim Oztoprak and Gozde Bozdagi Akar
- Published
- 2007
- Full Text
- View/download PDF
76. Effect of Visual Context Information for Super Resolution Problems
- Author
-
Gozde Bozdagi Akar, Kadircan Becek, Baran Cengiz, Savas Ozkan, and Ekin Aykut
- Subjects
Interpretation (logic) ,business.industry ,Computer science ,Deep learning ,Context (language use) ,02 engineering and technology ,Machine learning ,computer.software_genre ,Superresolution ,Convolutional neural network ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer - Abstract
In this study, the effect of visual context information to the performance of learning-based techniques for the super resolution problem is analyzed. Beside the interpretation of the experimental results in detail, its theoretical reasoning is also achieved in the paper. For the experiments, two different visual datasets composed of natural and remote sensing scenes are utilized. From the experimental results, we observe that keeping visual context information in the course of parameter learning for convolutional neural networks yields better performance compared to the baselines. Moreover, we summarize that finetuning pre-trained parameters with the related context yet fewer samples improves the results.
- Published
- 2019
77. Convolutional Neural Networks Analyzed via Inverse Problem Theory and Sparse Representations
- Author
-
Gozde Bozdagi Akar and Cem Tarhan
- Subjects
FOS: Computer and information sciences ,Deblurring ,Computer Science - Machine Learning ,Mutual coherence ,Computer science ,Computer Science::Neural and Evolutionary Computation ,Machine Learning (stat.ML) ,020206 networking & telecommunications ,02 engineering and technology ,Inverse problem ,Residual ,Convolutional neural network ,Machine Learning (cs.LG) ,Set (abstract data type) ,Statistics - Machine Learning ,Computer Science::Computer Vision and Pattern Recognition ,Signal Processing ,Convergence (routing) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Electrical and Electronic Engineering ,Algorithm ,Image restoration - Abstract
Inverse problems in imaging such as denoising, deblurring, superresolution (SR) have been addressed for many decades. In recent years, convolutional neural networks (CNNs) have been widely used for many inverse problem areas. Although their indisputable success, CNNs are not mathematically validated as to how and what they learn. In this paper, we prove that during training, CNN elements solve for inverse problems which are optimum solutions stored as CNN neuron filters. We discuss the necessity of mutual coherence between CNN layer elements in order for a network to converge to the optimum solution. We prove that required mutual coherence can be provided by the usage of residual learning and skip connections. We have set rules over training sets and depth of networks for better convergence, i.e. performance., PostPrint IET Signal Processing Journal
- Published
- 2018
78. A MAP-Based Approach for Hyperspectral Imagery Super-resolution
- Author
-
Seniha Esen Yuksel, Gozde Bozdagi Akar, Hasan Irmak, and OpenMETU
- Subjects
Markov random field ,Computer science ,0211 other engineering and technologies ,Hyperspectral imaging ,02 engineering and technology ,Iterative reconstruction ,Computer Graphics and Computer-Aided Design ,Superresolution ,0202 electrical engineering, electronic engineering, information engineering ,Maximum a posteriori estimation ,020201 artificial intelligence & image processing ,Image resolution ,Algorithm ,Software ,021101 geological & geomatics engineering - Abstract
In this study, we propose a novel single image Bayesian super-resolution (SR) algorithm where the hyperspectral image (HSI) is the only source of information. The main contribution of the proposed approach is to convert the ill-posed SR reconstruction (SRR) problem in the spectral domain to a quadratic optimization problem in the abundance map domain. In order to do so, Markov Random Field (MRF) based energy minimization approach is proposed and proved that the solution is quadratic. The proposed approach consists of five main steps. First, the number of endmembers in the scene is determined using virtual dimensionality. Second, the endmembers and their low resolution abundance maps are computed using simplex identification via the splitted augmented Lagrangian (SISAL) and fully constrained least squares (FCLS) algorithms. Third, high resolution (HR) abundance maps are obtained using our proposed maximum a posteriori (MAP) based energy function. This energy function is minimized subject to smoothness, unity and boundary constraints. Fourth, the HR abundance maps are further enhanced with texture preserving methods. Finally, HR HSI is reconstructed using the extracted endmembers and the enhanced abundance maps. The proposed method is tested on three real HSI datasets; namely the Cave, Harvard and Hyperspectral Remote Sensing Scenes (HRSS) and compared to state-of-the-art alternative methods using peak signal to noise ratio, structural similarity, spectral angle mapper and relative dimensionless global error in synthesis metrics. It is shown that the proposed method outperforms the state of the art methods in terms of quality while preserving the spectral consistency.
- Published
- 2018
79. Automatic color accuracy tests for camera performance comparison
- Author
-
Gozde Bozdagi Akar and Alican Hasarpa
- Subjects
Measure (data warehouse) ,Color constancy ,Scope (project management) ,business.industry ,Computer science ,Performance comparison ,Histogram ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Computer vision ,Artificial intelligence ,Variation (game tree) ,business ,Test (assessment) - Abstract
There are numerous criteria which are being used to measure camera performance and for determining such criteria, different tests are applied in different test environments. Within this framework, color accuracy testing at camera performance is one of foremost of such tests. In the scope of this paper, a method has been proposed to reduce user interaction in the color accuracy tests in the literature. At the same time, with the color constancy concept, it has been shown that color variation between the different test setups should also be considered as an important criterion on the camera performance.
- Published
- 2018
80. A comparison of inpainting techniques in image reanimation
- Author
-
Savas Ozkan, Ece Selin Boncu, and Gozde Bozdagi Akar
- Subjects
Computer science ,business.industry ,Perspective (graphical) ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Inpainting ,020206 networking & telecommunications ,02 engineering and technology ,Coherence (statistics) ,Iterative reconstruction ,Object (computer science) ,Image (mathematics) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Computer vision ,Artificial intelligence ,Graphics ,business ,Texture synthesis - Abstract
Inpainting applications include object removal on images and videos, crack filling, error concealment, texture synthesis, where in this paper, its usage for image coherence and perspective emphasis on video frames in 2D image-to-video conversion system is analysed. Besides, the performance of different techniques in object removal and image reconstruction is compared using visual experiments and quality metrics.
- Published
- 2018
81. A GPR-based landmine identification method using energy and dielectric features
- Author
-
Alper Genc and Gozde Bozdagi Akar
- Subjects
010504 meteorology & atmospheric sciences ,Computer science ,business.industry ,Feature extraction ,0211 other engineering and technologies ,Pattern recognition ,02 engineering and technology ,01 natural sciences ,Support vector machine ,Ground-penetrating radar ,Wave impedance ,False alarm ,Artificial intelligence ,business ,Classifier (UML) ,021101 geological & geomatics engineering ,0105 earth and related environmental sciences - Abstract
This study presents a novel landmine identification method that estimates intrinsic parameters of buried objects from their primary and secondary GPR reflections to reduce false alarm rates of GPR-based landmine detection algorithms. To achieve this, two different features are extracted from A-scan GPR data of buried objects. The first feature identifies significant GPR signal length. The second feature estimates intrinsic impedance of the object. These two features are classified with support vector machine (SVM) classifier. The experimental results show that the proposed features have very high discrimination power which reduces false alarm rates to a great extent.
- Published
- 2018
82. Exploiting Local Indexing and Deep Feature Confidence Scores for Fast Image-to-Video Search
- Author
-
Gozde Bozdagi Akar and Savas Ozkan
- Subjects
FOS: Computer and information sciences ,Orientation (computer vision) ,business.industry ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Search engine indexing ,Semantic search ,Computer Science - Computer Vision and Pattern Recognition ,020207 software engineering ,Pattern recognition ,02 engineering and technology ,Visualization ,Feature (computer vision) ,Pattern recognition (psychology) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Affine transformation ,Artificial intelligence ,Representation (mathematics) ,business - Abstract
The cost-effective visual representation and fast query-by-example search are two challenging goals that should be maintained for web-scale visual retrieval tasks on moderate hardware. This paper introduces a fast and robust method that ensures both of these goals by obtaining state-of-the-art performance for an image-to-video search scenario. Hence, we present critical enhancements to well-known indexing and visual representation techniques by promoting faster, better and moderate retrieval performance. We also boost the superiority of our method for some visual challenges by exploiting individual decisions of local and global descriptors at query time. For instance, local content descriptors represent copied/duplicated scenes with large geometric deformations such as scale, orientation and affine transformation. In contrast, the use of global content descriptors is more practical for near-duplicate and semantic searches. Experiments are conducted on a large-scale Stanford I2V dataset. The experimental results show that our method is useful in terms of complexity and query processing time for large-scale visual retrieval scenarios, even if local and global representations are used together. The proposed method is superior and achieves state-of-the-art performance based on the mean average precision (MAP) score of this dataset. Lastly, we report additional MAP scores after updating the ground annotations unveiled by retrieval results of the proposed method, and it shows that the actual performance., Comment: ICPR 2020
- Published
- 2018
- Full Text
- View/download PDF
83. Atmospheric Effects Removal for the Infrared Image Sequences
- Author
-
Seckin Ozsarac and Gozde Bozdagi Akar
- Subjects
Pixel ,business.industry ,Atmospheric correction ,Hyperspectral imaging ,Atmospheric model ,Noise-equivalent temperature ,Optics ,Computer Science::Computer Vision and Pattern Recognition ,Infrared window ,Radiance ,General Earth and Planetary Sciences ,Electrical and Electronic Engineering ,business ,Physics::Atmospheric and Oceanic Physics ,Optical depth ,Remote sensing - Abstract
Accurate correction of atmospheric effects on data captured by an infrared (IR) camera is crucial for several applications such as vegetation monitoring, temperature monitoring, satellite images, hyperspectral imaging, numerical model simulations, surface properties characterization, and IR measurement interpretation. Atmospheric effects depend on the temporal changes, i. e., year, season, day, hour, etc., and on the geometry between the camera and the measured scene, i. e., line of sight. The orientation and the optical depth of the camera significantly affect the variation of the geometry across the pixels. In this paper, we propose a method to estimate the range and zenith angle of each pixel using only the Global Positioning System (GPS) coordinates of the camera and a point of interest in the scene. The estimated geometry and measured meteorological data are used to obtain the spectral atmospheric transmittance and path radiance. Furthermore, we propose an atmospheric effects removal, i. e., atmospheric correction, method that considers the spectral characteristics of the detector, lens, and filter. The proposed atmospheric correction process is analyzed in detail with the simultaneous measurements of two IR cameras. In this process, an enhanced temperature calibration method is developed and it is shown that the temperature accuracy for the dynamic range of the IR camera is very close to the noise equivalent temperature difference (NETD) value of the camera.
- Published
- 2015
84. 2D high-frequency forward-looking sonar simulator based on continuous surfaces approach
- Author
-
Gozde Bozdagi Akar, Mehmet Kemal Leblebicioğlu, and Hakan Saç
- Subjects
Image formation ,Engineering ,General Computer Science ,business.industry ,Computation ,Process (computing) ,Sonar ,Image (mathematics) ,Forward looking ,Synthetic aperture sonar ,Computer vision ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Simulation - Abstract
Optical cameras give detailed images in clear waters. However, in dark or turbid waters, information coming from electro-optical sensors is insufficient for accurate scene perception. Imaging sonars, also known as acoustic cameras, can provide enhanced target details in these scenarios. In this paper, a computationally efficient 2D high-frequency, forward-looking sonar image simulator is presented. Stages and requirements of the image formation process are explained in detail. For the postprocessing of the returned sonar signals, a novel computation engine is proposed based on the geometric structures of the simulated surfaces. By treating all the continuous surfaces separately, the simulator is able to exactly calculate bright and shadowed zones in the 2D sonar image.
- Published
- 2015
85. EndNet: Sparse AutoEncoder Network for Endmember Extraction and Hyperspectral Unmixing
- Author
-
Berk Kaya, Savas Ozkan, and Gozde Bozdagi Akar
- Subjects
FOS: Computer and information sciences ,Endmember ,Spectral signature ,Remote sensing application ,business.industry ,Computer Vision and Pattern Recognition (cs.CV) ,0211 other engineering and technologies ,Computer Science - Computer Vision and Pattern Recognition ,Hyperspectral imaging ,Pattern recognition ,02 engineering and technology ,Autoencoder ,Statistics::Machine Learning ,Metric (mathematics) ,General Earth and Planetary Sciences ,Artificial intelligence ,Electrical and Electronic Engineering ,Divergence (statistics) ,Projection (set theory) ,business ,021101 geological & geomatics engineering - Abstract
Data acquired from multi-channel sensors is a highly valuable asset to interpret the environment for a variety of remote sensing applications. However, low spatial resolution is a critical limitation for previous sensors and the constituent materials of a scene can be mixed in different fractions due to their spatial interactions. Spectral unmixing is a technique that allows us to obtain the material spectral signatures and their fractions from hyperspectral data. In this paper, we propose a novel endmember extraction and hyperspectral unmixing scheme, so called \textit{EndNet}, that is based on a two-staged autoencoder network. This well-known structure is completely enhanced and restructured by introducing additional layers and a projection metric (i.e., spectral angle distance (SAD) instead of inner product) to achieve an optimum solution. Moreover, we present a novel loss function that is composed of a Kullback-Leibler divergence term with SAD similarity and additional penalty terms to improve the sparsity of the estimates. These modifications enable us to set the common properties of endmembers such as non-linearity and sparsity for autoencoder networks. Lastly, due to the stochastic-gradient based approach, the method is scalable for large-scale data and it can be accelerated on Graphical Processing Units (GPUs). To demonstrate the superiority of our proposed method, we conduct extensive experiments on several well-known datasets. The results confirm that the proposed method considerably improves the performance compared to the state-of-the-art techniques in literature., To appear in IEEE Transaction on Geoscience and Remote Sensing
- Published
- 2017
86. Comparative analysis of hyperspectral feature extraction methods in vegetation classification
- Author
-
Gozde Bozdagi Akar, Mertalp Ocal, and Kazim Ergun
- Subjects
Discrete wavelet transform ,010504 meteorology & atmospheric sciences ,Computer science ,business.industry ,Vegetation classification ,Feature extraction ,0211 other engineering and technologies ,Hyperspectral imaging ,Pattern recognition ,02 engineering and technology ,01 natural sciences ,Principal component analysis ,Artificial intelligence ,business ,Classifier (UML) ,021101 geological & geomatics engineering ,0105 earth and related environmental sciences - Abstract
To perform an accurate vegetation classification in hyperspectral data, feature extraction process prior to classification is very important. Success rates of classifiers in vegetation are rather limited compared to classification of other types of materials. Therefore, it is required to perform an effective feature extraction before classification. Principle Component Analysis(PCA) is a common and easily applicable method for this purpose. However, PCA is not an optimal method for distinguishing between different plant species. In this study, the reasons for PCA not being an adequate method for this purpose are discussed and alternative useful feature extraction methods in classification of plant species are examined. Tests were performed for Spectrally Segmented PCA(SSPCA), Discrete Wavelet Transform(DWT) and Genetic Algorithm(GA) feature extraction methods, their effects on classifier performances were compared and it was observed that all of the mentioned alternatives had noticable improvements in classification performances.
- Published
- 2017
87. Radiometric features for vehicle classification with infrared images
- Author
-
Seckin Ozsarac and Gozde Bozdagi Akar
- Subjects
Infrared ,Computer science ,business.industry ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Process (computing) ,Context (language use) ,02 engineering and technology ,Object (computer science) ,01 natural sciences ,Class (biology) ,010309 optics ,0103 physical sciences ,0202 electrical engineering, electronic engineering, information engineering ,Radiance ,Radiometry ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,business ,Remote sensing - Abstract
A vehicle classification system, which uses features based on radiometry, is developed for single band infrared (IR) image sequences. In this context, the process is divided into three components. These are moving vehicle detection, radiance estimation, and classification. The major contribution of this paper lies in the usage of the radiance values as features, other than the raw output of IR camera output, to improve the classification performance of the detected objects. The motivation behind this is that each vehicle class has a discriminating radiance value that originates from the source temperature of the object modified by the intrinsic characteristics of the radiating surface and the environment. As a consequence, significant performance gains are achieved due to the use of radiance values in classification for the utilized measurement system.
- Published
- 2017
88. Fast painting animation
- Author
-
Gozde Bozdagi Akar and Ece Selin Boncu
- Subjects
Measure (data warehouse) ,Painting ,business.industry ,Computer science ,010102 general mathematics ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Inpainting ,020206 networking & telecommunications ,Image processing ,02 engineering and technology ,Animation ,01 natural sciences ,Image (mathematics) ,Computer graphics (images) ,0202 electrical engineering, electronic engineering, information engineering ,Computer vision ,Artificial intelligence ,0101 mathematics ,business - Abstract
In this paper, an application of short video synthesis from single frame images is realized and a comparative analysis of different methods on image inpainting, which is a computationally costly part of the whole procedure, is provided. Our work is fortified with experiments in order to measure the computational performances and efficiencies of the proposed method and the ones existing in literature.
- Published
- 2017
89. Forward looking infrared imagery for landmine detection
- Author
-
Aylin Bayram and Gozde Bozdagi Akar
- Subjects
Thermal radiation ,Computer science ,020208 electrical & electronic engineering ,010401 analytical chemistry ,0202 electrical engineering, electronic engineering, information engineering ,Anomaly detection ,02 engineering and technology ,Forward looking infrared ,Radiation ,01 natural sciences ,0104 chemical sciences ,Remote sensing ,Constant false alarm rate - Abstract
Infrared imagery is widely used in many applications in both civilian and military areas. In landmine detection, the goal is to detect the anomalies between mine surface and soil from variation of reflected/emitted thermal radiation. In this study, various types of anomaly detection techniques of IR are investigated and the feasibility of these techniques for use in landmine detection is analyzed. Additionally, effects of parameters for algorithms are compared and the parameters are optimized for increasing detection accuracy. Furthermore, fusion of the algorithms is performed to reduce False Alarm Rate (FAR). We also prepare an experimental setup to reflect the effects of environmental changes on FLIR imagery recording. Soil and various types of landmine mock-ups are also examined in this setup. Finally, all anomalies are mapped into local coordinate space for indicating possible landmines locations.
- Published
- 2017
90. Texture and edge preserving multiframe super‐resolution
- Author
-
Emre Turgay and Gozde Bozdagi Akar
- Subjects
business.industry ,Texture (cosmology) ,Estimator ,Pattern recognition ,Iterative reconstruction ,Edge (geometry) ,Image (mathematics) ,Image texture ,Signal Processing ,Discrete cosine transform ,Computer vision ,Computer Vision and Pattern Recognition ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Image resolution ,Software - Abstract
Super-resolution (SR) image reconstruction refers to methods where a higher resolution image is reconstructed using a set of overlapping aliased low-resolution observations of the same scene. Although edge preservation has been a widely explored topic in SR literature, texture-specific regularisation has recently gained interest. In this study, texture-specific regularisation is handled as a post-processing step. A two stage method is proposed, comprising multiple SR reconstructions with different regularisation parameters followed by a restoration step for preserving edges and textures. In the first stage, two maximum-a-posteriori estimators with two different amounts of regularisation are employed. In the second stage, pixel-to-pixel difference between these two estimates is post-processed to restore edges and textures. Frequency selective characteristics of discrete cosine transform and Gabor filters are utilised in the post-processing step. Experiments on synthetically generated images and real experiments demonstrate that the proposed methods give better results compared with the state-of-the-art SR methods especially on textures and edges.
- Published
- 2014
91. Combination of physics-based and image-based features for landmine identification in ground penetrating radar data
- Author
-
Alper Genc and Gozde Bozdagi Akar
- Subjects
010504 meteorology & atmospheric sciences ,business.industry ,Perspective (graphical) ,Feature extraction ,0211 other engineering and technologies ,Pattern recognition ,02 engineering and technology ,01 natural sciences ,Parameter identification problem ,Identification (information) ,Feature (computer vision) ,Ground-penetrating radar ,General Earth and Planetary Sciences ,Clutter ,False alarm ,Artificial intelligence ,business ,021101 geological & geomatics engineering ,0105 earth and related environmental sciences - Abstract
Ground penetrating radar (GPR) is a powerful technology for detection and identification of buried explosives, especially with little or no metal content. However, subsurface clutter and soil distortions increase false alarm rates of current GPR-based landmine detection and identification methods. Most existing algorithms use shape-based, image-based, and physics-based techniques. Analysis of these techniques indicates that each type of algorithm has a different perspective to solve the landmine detection and identification problem. Therefore, one type of method has stronger and weaker points with respect to the other types of algorithms. To reduce false alarm rates of the current GPR-based landmine detection and identification methods, we propose a combined feature utilizing both physics-based and image-based techniques. Combined features are classified with a support vector machine classifier. The proposed algorithm is tested on a simulated data set that contained more than 500 innocuous object signatures and 400 landmine signatures, over half of which are completely nonmetal. The results presented indicate that the proposed method has significant performance benefits for landmine detection and identification in GPR data.
- Published
- 2019
92. Super-resolution Reconstruction of hyperspectral images via an improved MAP-based approach
- Author
-
Seniha Esen Yuksel, Hasan Irmak, Hakan Aytaylan, and Gozde Bozdagi Akar
- Subjects
Random field ,business.industry ,0211 other engineering and technologies ,Hyperspectral imaging ,Pattern recognition ,02 engineering and technology ,Energy minimization ,Least squares ,Computer Science::Computer Vision and Pattern Recognition ,Principal component analysis ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Minification ,Spectral resolution ,business ,Image resolution ,021101 geological & geomatics engineering ,Mathematics - Abstract
Super-resolution Reconstruction (SRR) is technique to increase the spatial resolution of images. It is especially useful for hyperspectral images (HSI), which have good spectral resolution but low spatial resolution. In this study, we propose an improvement to our previous work and present a novel MAP-MRF (maximum a posteriori-Markov random Fields) based approach for the SRR of HSI. The key point of our approach is to find the abundance maps of an HSI and perform SRR on the abundance maps using MRF based energy minimization, without needing any other additional source of information. In order to do so, first, PCA is used to determine the endmembers. Second, SISAL and fully constraint least squares (FCLS) are used to estimate the abundance maps. Third, in order to find the high resolution abundance maps, the ill-posed inverse SRR problem for abundances is regularized with a MAP-MRF based approach. The MAP-MRF formulation is restricted with the constraints which are specific to the abundances. Using the non-linear programming (NLP) techniques, the convex MAP formulation is minimized and High Resolution (HR) abundance maps are obtained. Then, these maps are used to construct the HR HSI. This improved SRR method is verified on real data sets, and quantitative performance comparison is achieved using PSNR, SSIM and PSNR metrics. Our results indicate that this improved method gives very close results to the original high resolution images, keeps the spectral consistency, and performs better than the compared algorithms.
- Published
- 2016
93. Fusion of KLMS and blob based pre-screener for buried landmine detection using ground penetrating radar
- Author
-
Serhat Ozturk, Gozde Bozdagi Akar, Seniha Esen Yuksel, and Bora Baydar
- Subjects
Fusion ,business.industry ,Computer science ,0211 other engineering and technologies ,020206 networking & telecommunications ,Pattern recognition ,02 engineering and technology ,Kernel (image processing) ,Ground-penetrating radar ,0202 electrical engineering, electronic engineering, information engineering ,False alarm ,Artificial intelligence ,business ,021101 geological & geomatics engineering ,Remote sensing - Abstract
In this paper, a decision level fusion using multiple pre-screener algorithms is proposed for the detection of buried landmines from Ground Penetrating Radar (GPR) data. The Kernel Least Mean Square (KLMS) and the Blob Filter pre-screeners are fused together to work in real time with less false alarms and higher true detection rates. The effect of the kernel variance is investigated for the KLMS algorithm. Also, the results of the KLMS and KLMS+Blob filter algorithms are compared to the LMS method in terms of processing time and false alarm rates. Proposed algorithm is tested on both simulated data and real data collected at the field of IPA Defence at METU, Ankara, Turkey.
- Published
- 2016
94. Real-time panoramic background subtraction on GPU
- Author
-
Serdar Buyuksarac, Gozde Bozdagi Akar, and Alptekin Temizel
- Subjects
Background subtraction ,Panorama ,business.industry ,Computer science ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Graphics processing unit ,Image registration ,02 engineering and technology ,01 natural sciences ,010309 optics ,Real-time computer graphics ,Robustness (computer science) ,Computer graphics (images) ,0103 physical sciences ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,General-purpose computing on graphics processing units ,business ,ComputingMethodologies_COMPUTERGRAPHICS - Abstract
In this study, we propose a method for panoramic background subtraction by using Pan-Tilt cameras in real-time. The proposed method is based on parallelization of image registration, panorama generation and background subtraction operations to run on Graphics Processing Unit (GPU). Experiments results showed that GPU usage increases speed of the algorithm 33 times without considerable performance loss and makes working real-time possible.
- Published
- 2016
95. Hyperspectral imagery superresolution
- Author
-
Seniha Esen Yuksel, Hasan Irmak, and Gozde Bozdagi Akar
- Subjects
business.industry ,Low resolution ,Resolution (electron density) ,0211 other engineering and technologies ,Hyperspectral imaging ,02 engineering and technology ,Superresolution ,Statistics::Machine Learning ,Abundance (ecology) ,Computer Science::Computer Vision and Pattern Recognition ,Full spectral imaging ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,Spectral resolution ,business ,Image resolution ,Geology ,021101 geological & geomatics engineering ,Remote sensing - Abstract
Despite their high spectral resolution, hyperspectral images have low spatial resolution which adversely affects the applications that use hyperspectral images. In this study, instead of the traditional way of using spectral images, abundances of the endmembers are used in resolution enhancement. In the proposed method, first, endmembers are extracted with the SISAL algorithm. Then, the abundance maps are estimated using FCLS. From the low resolution abundance maps, high resolution abundance maps are obtained with a total variation based minimization. Finally, high resolution hyperspectral images are constructed from high resolution abundance maps. The proposed method is tested on real hyperspectral images. The experimental results and comparative analysis show the effectiveness of the proposed method.
- Published
- 2016
96. A multimodal approach for aggressive driving detection
- Author
-
Gozde Bozdagi Akar, Enes Yuncu, and Omurcan Kumtepe
- Subjects
050210 logistics & transportation ,Computer science ,Feature vector ,05 social sciences ,Real-time computing ,02 engineering and technology ,Kalman filter ,Visualization ,Aggressive driving ,0502 economics and business ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Session (computer science) ,Simulation - Abstract
Aggressive driving behavior is among the important causes of traffic accidents. Hence, detection of driver aggressiveness has an importance in terms of decreasing the number of traffic accidents. Collected driving data while the vehicle is in traffic can be used to make inferences about the aggressiveness of the driver. In this study, a multimodal method is proposed in order to detect driver aggressiveness. The proposed method is based on utilizing the visual data taken from the on vehicle camera and sensor data taken from the controller area network bus (CAN-bus) in order to decide whether the driving session involves aggressive driving behavior. Lane following pattern and vehicle following distance information is obtained from the data collected by camera while vehicle speed and engine speed information is obtained from CAN-bus. These information is fused to conceive feature vectors that represent the driving session and aggressiveness decision is made according to the classification of these feature vectors.
- Published
- 2016
97. Scene Nudity Level Detection With Deep Nets
- Author
-
Gozde Bozdagi Akar, Ersin Esen, Savas Ozkan, and Ilkay Atil
- Subjects
Computer science ,business.industry ,Generalization ,020207 software engineering ,Pattern recognition ,02 engineering and technology ,010501 environmental sciences ,Machine learning ,computer.software_genre ,01 natural sciences ,Constant false alarm rate ,Set (abstract data type) ,Support vector machine ,0202 electrical engineering, electronic engineering, information engineering ,False alarm ,Artificial intelligence ,business ,computer ,Dropout (neural networks) ,0105 earth and related environmental sciences - Abstract
In this paper, we present an approach that can detect scene nudity level with high precision using different deep net configurations. For this purpose, a recent approach [1] which has intense and very deep convolution layers is used. During net modelling, we strive to obtain most successful net configuration by comparing different Dropout models and image sizes -64 × 64, 128 × 128-. Additionally, leveraging the generalization capability of Support Vector Machine (SVM), improvement on success rate is demonstrated by retraining the features obtained at different output levels of the nets with SVM. At test and training stages, scene is investigated under three nudity levels: regular, semi-nudity and full-nudity. In order to evaluate false alarm rates of the net models, tests are conducted on different datasets which are SUN2012, LEAR Human and a dataset contains only semi-nudity samples besides validation set determined for each class. The results indicate that high precision rates can be achieved with low false alarm rate exploiting deep net models.
- Published
- 2016
98. Improved prediction methods for scalable predictive animated mesh compression
- Author
-
M. Oguz Bici and Gozde Bozdagi Akar
- Subjects
Theoretical computer science ,Computer science ,020207 software engineering ,Data_CODINGANDINFORMATIONTHEORY ,02 engineering and technology ,ENCODE ,Compression (functional analysis) ,Signal Processing ,Scalability ,0202 electrical engineering, electronic engineering, information engineering ,Media Technology ,020201 artificial intelligence & image processing ,Polygon mesh ,Computer Vision and Pattern Recognition ,Static mesh ,Electrical and Electronic Engineering ,Quantization (image processing) ,Encoder ,Algorithm ,Decoding methods - Abstract
Animated meshes represented as sequences of static meshes sharing the same connectivity require efficient compression. Among the compression techniques, layered predictive coding methods efficiently encode the animated meshes in a structured way such that the successive reconstruction with an adaptable quality can be performed. The decoding quality heavily depends on how well the prediction is performed in the encoder. Due to this fact, in this paper, three novel prediction structures are proposed and integrated into a state of the art layered predictive coder. The proposed structures are based on weighted spatial prediction with its weighted refinement and angular relations of triangles between current and previous frames. The experimental results show that compared to the state of the art scalable predictive coder, up to 30% bitrate reductions can be achieved with the combination of proposed prediction schemes depending on the content and quantization level.
- Published
- 2011
99. Multiple description coding of animated meshes
- Author
-
Gozde Bozdagi Akar and M. Oguz Bici
- Subjects
Multiple description coding ,020207 software engineering ,02 engineering and technology ,Rate–distortion theory ,Signal Processing ,Scalability ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Polygon mesh ,Computer Vision and Pattern Recognition ,Static mesh ,Electrical and Electronic Engineering ,Error detection and correction ,Algorithm ,Software ,Decoding methods ,Mathematics ,Coding (social sciences) - Abstract
In this paper, we propose three novel multiple description coding (MDC) methods for reliable transmission of compressed animated meshes represented by series of 3D static meshes with same connectivity. The proposed methods trade off reconstruction quality for error resilience to provide the best expected reconstruction of 3D mesh sequence at the decoder side. The methods are based on layer duplication and partitioning of the set of vertices of a scalable coded animated mesh by either spatial or temporal subsampling. Each partitioned set is encoded separately to generate independently decodable bitstreams or so-called descriptions. In addition, the proposed MDC methods can achieve varying redundancy allocations by including a number of encoded spatial or temporal layers from the other description. The algorithms are evaluated with redundancy-rate-distortion (RRD) curves and per-frame reconstruction analysis. RRD performances show that vertex partitioning-based MDC performs better at low redundancies for especially spatially dense models. Temporal subsampling-based MDC performs better at moderate redundancies as well as low redundancies for spatially coarse models. Layer duplication-based MDC can achieve the lowest redundancies with flexible redundancy allocation capability and can be designed to achieve the smallest variance of reconstruction quality between consecutive frames.
- Published
- 2010
100. Automatic detection of learning styles for an e-learning system
- Author
-
Gozde Bozdagi Akar and Ebru Özpolat
- Subjects
General Computer Science ,business.industry ,Instructional design ,Computer science ,Information technology ,Machine learning ,computer.software_genre ,Automation ,Education ,Learning styles ,Personality factors ,Computer software ,ComputingMilieux_COMPUTERSANDEDUCATION ,Artificial intelligence ,business ,computer ,Classifier (UML) ,Cognitive style - Abstract
A desirable characteristic for an e-learning system is to provide the learner the most appropriate information based on his requirements and preferences. This can be achieved by capturing and utilizing the learner model. Learner models can be extracted based on personality factors like learning styles, behavioral factors like user's browsing history and knowledge factors like user's prior knowledge. In this paper, we address the problem of extracting the learner model based on Felder-Silverman learning style model. The target learners in this problem are the ones studying basic science. Using NBTree classification algorithm in conjunction with Binary Relevance classifier, the learners are classified based on their interests. Then, learners' learning styles are detected using these classification results. Experimental results are also conducted to evaluate the performance of the proposed automated learner modeling approach. The results show that the match ratio between the obtained learner's learning style using the proposed learner model and those obtained by the questionnaires traditionally used for learning style assessment is consistent for most of the dimensions of Felder-Silverman learning style.
- Published
- 2009
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.