Descriptor: "Receptive field" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Receptive field"' showing total 15,078 results

Start Over Descriptor "Receptive field"

15,078 results on '"Receptive field"'

1. A data augmentation method for computer vision task with feature conversion between class

Author: Lin, Jiewen, Hu, Gui, and Chen, Jian
Published: 2025
Full Text: View/download PDF

2. GABAergic amacrine cells balance biased chromatic information in the mouse retina

Author: Korympidou, Maria M., Strauss, Sarah, Schubert, Timm, Franke, Katrin, Berens, Philipp, Euler, Thomas, and Vlasits, Anna L.
Published: 2024
Full Text: View/download PDF

3. Heterogeneous orientation tuning in the primary visual cortex of mice diverges from Gabor-like receptive fields in primates

Author: Fu, Jiakun, Pierzchlewicz, Paweł A., Willeke, Konstantin F., Bashiri, Mohammad, Muhammad, Taliah, Diamantaki, Maria, Froudarakis, Emmanouil, Restivo, Kelli, Ponder, Kayla, Denfield, George H., Sinz, Fabian, Tolias, Andreas S., and Franke, Katrin
Published: 2024
Full Text: View/download PDF

4. Dual selective fusion transformer network for hyperspectral image classification

Author: Xu, Yichu, Wang, Di, Zhang, Lefei, and Zhang, Liangpei
Published: 2025
Full Text: View/download PDF

5. Enhanced human contrast sensitivity with increased stimulation of melanopsin in intrinsically photosensitive retinal ganglion cells

Author: Chien, Sung-En, Yeh, Su-Ling, Yamashita, Wakayo, and Tsujimura, Sei-ichi
Published: 2023
Full Text: View/download PDF

6. Window-Based Channel Attention for Wavelet-Enhanced Learned Image Compression

Author: Xu, Heng, Hai, Bowen, Tang, Yushun, He, Zhihai, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Cho, Minsu, editor, Laptev, Ivan, editor, Tran, Du, editor, Yao, Angela, editor, and Zha, Hongbin, editor
Published: 2025
Full Text: View/download PDF

7. MMR-Sleep: A Multi-Channel and Multi-Receptive Field Sleep Stage Recognition Model

Author: Zheng, Deqin, Zhu, Haiqi, Gao, Ruichen, Song, Chenyue, Zhang, Wei, Jiang, Feng, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Lin, Zhouchen, editor, Cheng, Ming-Ming, editor, He, Ran, editor, Ubul, Kurban, editor, Silamu, Wushouer, editor, Zha, Hongbin, editor, Zhou, Jie, editor, and Liu, Cheng-Lin, editor
Published: 2025
Full Text: View/download PDF

8. Normative theory of visual receptive fields

Author: Lindeberg, Tony
Published: 2021
Full Text: View/download PDF

9. Orientation selectivity properties for the affine Gaussian derivative and the affine Gabor models for visual receptive fields.

Author: Lindeberg, Tony
Abstract: This paper presents an in-depth theoretical analysis of the orientation selectivity properties of simple cells and complex cells, that can be well modelled by the generalized Gaussian derivative model for visual receptive fields, with the purely spatial component of the receptive fields determined by oriented affine Gaussian derivatives for different orders of spatial differentiation. A detailed mathematical analysis is presented for the three different cases of either: (i) purely spatial receptive fields, (ii) space-time separable spatio-temporal receptive fields and (iii) velocity-adapted spatio-temporal receptive fields. Closed-form theoretical expressions for the orientation selectivity curves for idealized models of simple and complex cells are derived for all these main cases, and it is shown that the orientation selectivity of the receptive fields becomes more narrow, as a scale parameter ratio κ , defined as the ratio between the scale parameters in the directions perpendicular to vs. parallel with the preferred orientation of the receptive field, increases. It is also shown that the orientation selectivity becomes more narrow with increasing order of spatial differentiation in the underlying affine Gaussian derivative operators over the spatial domain. A corresponding theoretical orientation selectivity analysis is also presented for purely spatial receptive fields according to an affine Gabor model, showing that: (i) the orientation selectivity becomes more narrow when making the receptive fields wider in the direction perpendicular to the preferred orientation of the receptive field; while (ii) an additional degree of freedom in the affine Gabor model does, however, also strongly affect the orientation selectivity properties. [ABSTRACT FROM AUTHOR]
Published: 2025
Full Text: View/download PDF

10. Chilean brush-tailed mouse (Octodon degus): a diurnal precocial rodent as a new model to study visual receptive field properties of superior colliculus neurons.

Author: Márquez, Natalia I., Deichler, Alfonso, Fernández‐Aburto, Pedro, Perales, Ignacio, Letelier, Juan-Carlos, Marín, Gonzalo J., Mpodozis, Jorge, and Pallas, Sarah L.
Subjects: *VISUAL fields, *CONTRAST sensitivity (Vision), *VISUAL acuity, *RODENTS, *PRIMATES, *SUPERIOR colliculus
Abstract: Lab rodent species commonly used to study the visual system and its development (hamsters, rats, and mice) are crepuscular/nocturnal, altricial, and possess simpler visual systems than carnivores and primates. To widen the spectra of studied species, here we introduce an alternative model, the Chilean degu (Octodon degus). This diurnal, precocial Caviomorph rodent has a cone-enriched, well-structured retina, and well-developed central visual projections. To assess degus' visual physiological properties, we characterized the visual responses and receptive field (RF) properties of isolated neurons in the superficial layers of the superior colliculus (sSC). To facilitate comparison with studies in other rodent species, we used four types of stimuli: 1) a moving white square, 2) sinusoidal gratings, 3) an expanding black circle (looming), and 4) a stationary black circle. We found that as in other mammalian species, RF size increases from superficial to deeper SC layers. Compared with other lab rodents, degus sSC neurons had smaller RF sizes and displayed a broader range of spatial frequency (SF) tunings, including neurons tuned to high SF (up to 0.24 cycles/deg). Also, unlike other rodents, approximately half of sSC neurons exhibited linear responses to contrast. In addition, sSC units showed transient ON-OFF responses to stationary stimuli but increased their firing rates as a looming object increased in size. Our results suggest that degus have higher visual acuity, higher SF tuning, and lower contrast sensitivity than commonly used nocturnal lab rodents, positioning degus as a well-suited species for studies of diurnal vision that are more relevant to humans. NEW & NOTEWORTHY: Rodent species commonly used to study vision are crepuscular/nocturnal, altricial, and possess simpler visual systems than diurnal mammals. Here we introduce an alternative model, the diurnal, precocial, Octodon degus, a Caviomorph rodent with a well-developed visual system. In this study, we characterize the visual responses of the degus' superior colliculus. Our results suggest that degus have higher visual acuity than nocturnal rodents, positioning degus as a well-suited species for studies of human-like diurnal vision. [ABSTRACT FROM AUTHOR]
Published: 2025
Full Text: View/download PDF

11. A hybrid attention multi-scale fusion network for real-time semantic segmentation.

Author: Ye, Baofeng, Xue, Renzheng, and Wu, Qianlong
Subjects: *COGNITIVE psychology, *DATA mining, *ATTENTION, *SPEED, *ALGORITHMS
Abstract: In semantic segmentation research, spatial information and receptive fields are essential. However, currently, most algorithms focus on acquiring semantic information and lose a significant amount of spatial information, leading to a significant decrease in accuracy despite improving real-time inference speed. This paper proposes a new method to address this issue. Specifically, we have designed a new module (HFRM) that combines channel attention and spatial attention to retrieve the spatial information lost during downsampling and enhance object classification accuracy. Regarding fusing spatial and semantic information, we have designed a new module (HFFM) to merge features of two different levels more effectively and capture a larger receptive field through an attention mechanism. Additionally, edge detection methods have been incorporated to enhance the extraction of boundary information. Experimental results demonstrate that for an input size of 512 × 1024, our proposed method achieves 73.6% mIoU at 176 frames per second (FPS) on the Cityscapes dataset and 70.0% mIoU at 146 FPS on Camvid. Compared to existing networks, our Model achieves faster inference speed while maintaining accuracy, enhancing its practicality. [ABSTRACT FROM AUTHOR]
Published: 2025
Full Text: View/download PDF

12. A multiscale feature fusion‐guided lightweight semantic segmentation network.

Author: Ye, Xin, Pan, Junchen, Chen, Jichen, and Zhang, Jingbo
Subjects: CASCADE connections, AUTONOMOUS vehicles
Abstract: Semantic segmentation, a task of assigning class labels to each pixel in an image, has found applications in various real‐world scenarios, including autonomous driving and scene understanding. However, its widespread use is hindered by the high computational burden. In this paper, we propose an efficient semantic segmentation method based on Feature Cascade Fusion Network (FCFNet) to address this challenge. FCFNet utilizes a dual‐path framework comprising the Spatial Information Path (SIP) and the Context Information Path (CIP). SIP is a shallow structure that captures the local dependencies of each pixel to improve the accuracy of detailed segmentation. CIP is the main branch with a deeper structure that captures sufficient contextual information from input features. Moreover, we design an Efficient Receptive Field Module (ERFM) to enlarge the receptive field in the SIP. Meanwhile, Attention Shuffled Refinement Module is used to refine feature maps from different stages. Finally, we present an Attention‐Guided Fusion Module to fuse the low‐ and high‐level feature maps effectively. Experimental results show that our proposed FCFNet achieves 70.7% mean intersection over union (mIoU) on the Cityscapes data set and 68.1% mIoU on the CamVid data set, respectively, with inference speeds of 110 and 100 frames per second (FPS), respectively. Additionally, we evaluated FCFNet on the Nvidia Jetson Xavier embedded device, which demonstrated competitive performance while significantly reducing power consumption. [ABSTRACT FROM AUTHOR]
Published: 2025
Full Text: View/download PDF

13. Learning delays through gradients and structure: emergence of spatiotemporal patterns in spiking neural networks.

Author: Mészáros, Balázs, Knight, James C., and Nowotny, Thomas
Subjects: ARTIFICIAL neural networks, ELECTRONIC data processing
Abstract: We present a Spiking Neural Network (SNN) model that incorporates learnable synaptic delays through two approaches: per-synapse delay learning via Dilated Convolutions with Learnable Spacings (DCLS) and a dynamic pruning strategy that also serves as a form of delay learning. In the latter approach, the network dynamically selects and prunes connections, optimizing the delays in sparse connectivity settings. We evaluate both approaches on the Raw Heidelberg Digits keyword spotting benchmark using Backpropagation Through Time with surrogate gradients. Our analysis of the spatio-temporal structure of synaptic interactions reveals that, after training, excitation and inhibition group together in space and time. Notably, the dynamic pruning approach, which employs DEEP R for connection removal and RigL for reconnection, not only preserves these spatio-temporal patterns but outperforms per-synapse delay learning in sparse networks. Our results demonstrate the potential of combining delay learning with dynamic pruning to develop efficient SNN models for temporal data processing. Moreover, the preservation of spatio-temporal dynamics throughout pruning and rewiring highlights the robustness of these features, providing a solid foundation for future neuromorphic computing applications. [ABSTRACT FROM AUTHOR]
Published: 2025
Full Text: View/download PDF

14. Wavelet-Transform-Based Neural Network for Tidal Flat Remote Sensing Image Deblurring

Author: Denghao Yang, Zhiyu Zhu, Huilin Ge, Cheng Xu, and Jing Zhang
Subjects: Combined loss function, dilated convolution, image deblurring, receptive field, tidal flat scenarios, wavelet transform, Ocean engineering, TC1501-1800, Geophysics. Cosmic physics, QC801-809
Abstract: In response to the challenge of image degradation caused by strong sea breezes during drone surveillance over tidal flats, conventional methodologies have predominantly employed iterative upsampling and downsampling techniques to augment the receptive field of the network. However, this approach is prone to the loss of critical texture data within the tidal flat imagery throughout the sampling process. To mitigate these issues and enhance the recovery of sharp imagery from blurred inputs, we introduce a novel deep learning architecture based on traditional physical models. Our network structure mainly consists of two parts: Initially, applying wavelet transform to the input images, the extracted high-frequency components are refined using a combination of Bayesian adaptive thresholding and hard thresholding. This process not only ensures high fidelity of the high-frequency information but also contributes to generating tidal flat images with clearer texture details. Subsequently, given the relatively large scope of images captured by drones, there is an increased emphasis on the importance of contextual information during the feature extraction process. To this end, we have applied dilated convolution modules with varying dilation rates to the low-frequency components. This design enables the network to capture image features at different scales, enhancing the model's understanding of the tidal flat scene context and improving its feature expression capability. This allows the model to more accurately identify and extract blurred areas, thereby improving the deblurring effect. Additionally, we have incorporated a loss function based on the wavelet transform. This function guides the model to recover clear details from blurred images by minimizing the differences between the high-frequency and low-frequency components of the original clear image and those of the deblurred image. In the quantitative assessment of the real tidal flat image dataset, we observed that the algorithm has a parameter volume of 7.8M and has achieved significant performance improvement: the peak signal-to-noise ratio (PSNR) reached 33.11, and the structural similarity index reached 0.7909. The enhancement of these metrics indicates that the algorithm excels in the recovery of image texture details while maintaining a compact parameter count. The optimized parameter configuration not only improves the algorithm's operational efficiency but also simplifies the deployment and training process of the model, making it more suitable for tidal flat scenarios.
Published: 2025
Full Text: View/download PDF

15. GLF-NET: Global and Local Dynamic Feature Fusion Network for Real-Time Steel Strip Surface Defect Detection

Author: Yunfei Ma, Zhaohui Zhang, Shaocheng Ma, Kailun Shi, and Chenglong Fan
Subjects: Surface defect detection, YOLOv5s, feature fusion, receptive field, attention mechanism, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: Surface defect detection plays a crucial role in ensuring the quality standards of hot-rolled steel strips. To meet the demands for high precision and real-time performance in industrial defect detection, this paper introduces an improved one-stage detector based on YOLOv5s, named GLF-NET, that focuses on a good balance between speed and precision. Firstly, an Attention Augmented Module (AAM) is proposed and used in the backbone, with the aim of minimizing the loss of semantic and location information of defects during the process of feature extraction. Secondly, to enrich the model’s capacity of representing multi-scale features of defects, an innovative Global and Local Dynamic Feature Fusion (GLF) module is designed and plugged into the top-down FPN part of the neck, bridging the semantic gap between different feature layers and enabling the model to adaptively select features for fusion. Additionally, a novel Receptive Field Augmented Module (RFA) is proposed and integrated into the bottom-up PAN structure of the neck, enhancing the detector’s ability of perceiving defects with irregular shapes and large aspect ratios. Extensive experimental results on the NEU-DET steel strip surface defect dataset demonstrate that GLF-NET obtains an impressive mAP value of 79.2%, exceeding YOLOv5s by 4.2%. Furthermore, with an impressive detection speed of 95 Frames Per Second (FPS), GLF-NET not only meets the real-time demands of industrial defect detection but also demonstrates exceptional capabilities in defect detection. Code is available at https://github.com/MYF1124/GLF-NET.
Published: 2025
Full Text: View/download PDF

16. Alpha-2 nicotinic acetylcholine receptors regulate spectral integration in auditory cortex

Author: Intskirveli, Irakli, Gil, Susan, Lazar, Ronit, and Metherate, Raju
Subjects: Biomedical and Clinical Sciences, Neurosciences, 1.1 Normal biological development and functioning, Neurological, Animals, Auditory Cortex, Receptors, Nicotinic, Mice, Mice, Transgenic, Male, Mice, Inbred C57BL, Nicotine, Female, Acoustic Stimulation, Mice, Knockout, Interneurons, Nicotinic Agonists, nicotine, mouse, receptive field, electrophysiology, current-source density, neuromodulation, martinotti, Biological psychology
Abstract: IntroductionIn primary auditory cortex (A1), nicotinic acetylcholine receptors (nAChRs) containing α2 subunits are expressed in layer 5 Martinotti cells (MCs)-inhibitory interneurons that send a main axon to superficial layers to inhibit distal apical dendrites of pyramidal cells (PCs). MCs also contact interneurons in supragranular layers that, in turn, inhibit PCs. Thus, MCs may regulate PCs via inhibition and disinhibition, respectively, of distal and proximal apical dendrites. Auditory inputs to PCs include thalamocortical inputs to middle layers relaying information about characteristic frequency (CF) and near-CF stimuli, and intracortical long-distance ("horizontal") projections to multiple layers carrying information about spectrally distant ("nonCF") stimuli. CF and nonCF inputs integrate to create broad frequency receptive fields (RFs). Systemic administration of nicotine activates nAChRs to "sharpen" RFs-to increase gain within a narrowed RF-resulting in enhanced responses to CF stimuli and reduced responses to nonCF stimuli. While nicotinic mechanisms to increase gain have been identified, the mechanism underlying RF narrowing is unknown.MethodsHere, we examine the role of α2 nAChRs in mice with α2 nAChR-expressing neurons labeled fluorescently, and in mice with α2 nAChRs genetically deleted.ResultsThe distribution of fluorescent neurons in auditory cortex was consistent with previous studies demonstrating α2 nAChRs in layer 5 MCs, including nonpyramidal somata in layer 5 and dense processes in layer 1. We also observed label in subcortical auditory regions, including processes, but no somata, in the medial geniculate body, and both fibers and somata in the inferior colliculus. Using electrophysiological (current-source density) recordings in α2 nAChR knock-out mice, we found that systemic nicotine failed to enhance CF-evoked inputs to layer 4, suggesting a role for subcortical α2 nAChRs, and failed to reduce nonCF-evoked responses, suggesting that α2 nAChRs regulate horizontal projections to produce RF narrowing.DiscussionThe results support the hypothesis that α2 nAChRs function to simultaneously enhance RF gain and narrow RF breadth in A1. Notably, a similar neural circuit may recur throughout cortex and hippocampus, suggesting widespread conserved functions regulated by α2 nAChRs.
Published: 2024

17. LGCGNet: A local-global context guided network for real-time water surface semantic segmentation: LGCGNet: A local-global context guided network for real-time water...: T. Liu et al.

Author: Liu, Ting, Luo, Peiqi, Wang, Guofeng, Zhang, Yuxin, Lu, Xiangyi, and Dong, Mengyu
Abstract: Unmanned boats will encounter many static and dynamic obstacles during navigation, and only real-time obstacle sensing can ensure safe navigation and long endurance of unmanned boats. In this paper, LGCGNet is proposed to perform real-time water surface semantic segmentation on the images captured by the on-board camera. In order to ensure that the model adapted to obstacles with extremely variable scales, a local-global module is proposed in this paper. The local-global module consisted of residual dense dilated module and context-enhanced separable self-attention. Residual dense dilated module enabled the enhancement of local detail information and context-enhanced separable self-attention enabled model receptive field expansion. In addition, the sub-pixel downsampling module is used to avoid the loss of feature information to improve segmentation accuracy. Experiments on the MaSTr1325 dataset showed that LGCGNet apprpached the segmentation accuracy of state-of-the-art semantic segmentation models with only 689,000 parameters and 9.068G floating point operations per second, with an mIoU of 84.14%. In addition, the processing speed of LGCGNet is 34.86FPS, which meets the frame rate conditions of commercially available photovoltaic equipment. The experiments demonstrated that the LGCGNet proposed in this paper strike a good balance between achieving high accuracy, reducing model size and improving real-time performance. [ABSTRACT FROM AUTHOR]
Published: 2025
Full Text: View/download PDF

18. Lightweight remote sensing image super-resolution with sequential multi-scale feature enhancement network.

Author: Qi, Ailing and Qi, Shuai
Subjects: *REMOTE sensing, *FEATURE extraction, *HIGH resolution imaging, *SUPPLY & demand
Abstract: Remote sensing images possess abundant texture features and significant autocorrelation. However, the extensive network parameters and high computational demands of current super-resolution (SR) methods make them challenging to implement on mobile devices. This work proposes a lightweight model named the Sequential Multi-Scale Feature Enhancement Network (SMFEN) that address the issue on single remote sensing image super-resolution. Our sequential structure allows for a larger receptive field (RF) with minimal parameters, which can gradually build complex high-level multi-scale feature representations from simple low-level features, realizing the feature extraction process from concrete to abstract. In addition, we design a high-frequency multi-scale attention block which use multi-scale high-frequency details to facilitate the fusion of contextual information across different scales and effectively recover texture and edge information. Comprehensive experimental results demonstrate that our SMFEN network outperforms the latest lightweight SR methods. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

19. Feature selectivity and invariance in marsupial primary visual cortex.

Author: Jung, Young Jun, Almasi, Ali, Sun, Shi, Yunzab, Molis, Baquier, Sebastien H., Renfree, Marilyn, Meffin, Hamish, and Ibbotson, Michael R.
Subjects: *VISUAL cortex, *VISUAL fields, *SPATIAL filters, *VISUAL perception, *RANDOM noise theory
Abstract: Key points A fundamental question in sensory neuroscience revolves around how neurons represent complex visual stimuli. In mammalian primary visual cortex (V1), neurons decode intricate visual features to identify objects, with most being selective for edge orientation, but with half of those also developing invariance to edge position within their receptive fields. Position invariance allows cells to continue to code an edge even when it moves around. Combining feature selectivity and invariance is integral to successful object recognition. Considering the marsupial–eutherian divergence 160 million years ago, we explored whether feature selectivity and invariance was similar in marsupials and eutherians. We recovered the spatial filters and non‐linear processing characteristics of the receptive fields of neurons in wallaby V1 and compared them with previous results from cat cortex. We stimulated the neurons in V1 with white Gaussian noise and analysed responses using the non‐linear input model. Wallabies exhibit the same high percentage of orientation selective neurons as cats. However, in wallabies we observed a notably higher prevalence of neurons with three or more filters compared to cats. We show that having three or more filters substantially increases phase invariance in the V1s of both species, but that wallaby V1 accentuates this feature, suggesting that the species condenses more processing into the earliest cortical stage. These findings suggest that evolution has led to more than one solution to the problem of creating complex visual processing strategies. Previous studies have shown that the primary visual cortex (V1) in mammals is essential for processing complex visual stimuli, with neurons displaying selectivity for edge orientation and position. This research explores whether the visual processing mechanisms in marsupials, such as wallabies, are similar to those in eutherian mammals (e.g. cats). The study found that wallabies have a higher prevalence of neurons with multiple spatial filters in V1, indicating more complex visual processing. Using a non‐linear input model, we demonstrated that neurons with three or more filters increase phase invariance. These findings suggest that marsupials and eutherian mammals have evolved similar strategies for visual processing, but marsupials have condensed more capacity to build phase invariance into the first step in the cortical pathway. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

20. Feature-adaptive FPN with multiscale context integration for underwater object detection.

Author: Bhalla, Shikha, Kumar, Ashish, and Kushwaha, Riti
Subjects: *OBJECT recognition (Computer vision), *COMPUTER vision, *ATTENUATION of light, *MARINE biology, *GENERALIZATION
Abstract: Underwater object detection is vital for diverse applications, from studies in marine biology to underwater robotics. However, underwater environments pose unique challenges, including reduced visibility due to color distortion, light attenuation, and complex backgrounds. Traditional computer vision methods have limitations, prompting the implementation of deep learning, for underwater object detection. Despite progress, challenges persist, such as visual degradation, scale variations, diverse marine species, and complex backgrounds. To address these issues, we propose Feature-Adaptive FPN with Multiscale Context Integration (FA-FPN-MCI), a novel deep-learning algorithm aimed at enhancing both detection and domain generalization performance. We integrate the Style Normalization and Restitution (SNR) module for domain generalization, Receptive Field Blocks (RFBs) for fine-grained detail capture, and a twin-branch Global Context Module (TBGCM) for multiscale context information. We enhance lateral connections within the Feature Pyramid Network (FPN) with deformable convolution. Experimental outcome reveal that the proposed method attains mean average precision of 84.2%. Additionally, other performance metrics were evaluated, and outperforming all other methods used for comparison. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

21. PillarVTP: vehicle trajectory prediction method based on local point cloud aggregation and receptive field expansion.

Author: Liao, Zhuhua, Yang, Jiyuan, Zhao, Yijiang, Liu, Yizhi, and Zhang, Hui
Abstract: Vehicle trajectory prediction plays a crucial role in the control and safety warning of autonomous vehicles. Existing methods often depend on costly high definition (HD) maps for generating trajectories to fit their scenarios, or involve inefficient aggregation of local point clouds into voxels. Therefore, an end-to-end vehicle trajectory prediction method (PillarVTP) is proposed based on local point cloud aggregation and receptive field expansion. Firstly, we construct a novel pillar-based object detection network, introducing SPPCSPC which uses max pooling layers with multiple kernel sizes on a single feature level as the neck for extracting multi-scale features, and improving ResNet-18 by adding a depth stage to expand the receptive field at multiple levels. Then, we present performing feature upsampling to improve performance before predicting vehicle positions. And a shallow convolutional network is utilized to implement the future feature learning network, which learns future features from the previous features for predicting vehicle positions in future frames. Subsequently, the positions of vehicles are matched greedily from future frames to the current frame, and the matched future trajectories are associated with the vehicles detected in the current frame. Finally, the proposed PillarVTP is evaluated on the nuScenes and Argoverse 1 datasets. Experimental results demonstrate that PillarVTP outperforms recent end-to-end prediction method based on point cloud data, FutureDet, by 3.4% and surpasses traditional multi-stage method, Trajectron + + , by 13.7%. Furthermore, PillarVTP shows good robustness under various weather conditions. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

22. SCENet: Small Kernel Convolution with Effective Receptive Field Network for Brain Tumor Segmentation.

Author: Guo, Bin, Cao, Ning, Zhang, Ruihao, and Yang, Peng
Subjects: BRAIN tumors, DEEP learning, IMAGE segmentation, DIAGNOSTIC imaging, DIAGNOSIS
Abstract: Brain tumors are serious conditions, which can cause great trauma to patients, endangering their health and even leading to disability or death. Therefore, accurate preoperative diagnosis is particularly important. Accurate brain tumor segmentation based on deep learning plays an important role in the preoperative treatment planning process and has achieved good performance. However, one of the challenges involved is an insufficient ability to extract features with a large receptive field in encoder layers and guide the selection of deep semantic information in decoder layers. We propose small kernel convolution with an effective receptive field network (SCENet) based on UNet, which involves a small kernel convolution with effective receptive field shuffle module (SCER) and a channel spatial attention module (CSAM). The SCER module utilizes the inherent properties of stacking convolution to obtain effectively receptive fields and improve the features with a large receptive field extraction ability. CSAM of decoder layers can preserve more detailed features to capture clearer contours of the segmented image by calculating the weights of channels and spaces. An ASPP module is introduced to the bottleneck layer to enlarge the receptive field and can capture multi-scale detailed features. Furthermore, a large number of experiments were performed to evaluate the performance of our model on the BraTS2021 dataset. The SCENet achieved dice coefficient scores of 91.67%, 87.70%, and 83.35% for whole tumor (WT), tumor core (TC), and enhancing tumor (ET), respectively. The results show that the proposed model achieves the state-of-the-art performance compared with more than twelve benchmarks. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

23. A new role for excitation in the retinal direction‐selective circuit.

Author: Ankri, Lea, Riccitelli, Serena, and Rivlin‐Etzion, Michal
Subjects: *RECEPTIVE fields (Neurology), *RETINAL ganglion cells, *VISUAL acuity, *GABA, *ELECTROPHYSIOLOGY
Abstract: A key feature of the receptive field of neurons in the visual system is their centre–surround antagonism, whereby the centre and the surround exhibit responses of opposite polarity. This organization is thought to enhance visual acuity, but whether and how such antagonism plays a role in more complex processing remains poorly understood. Here, we investigate the role of centre and surround receptive fields in retinal direction selectivity by exposing posterior‐preferring On–Off direction‐selective ganglion cells (pDSGCs) to adaptive light and recording their response to globally moving objects. We reveal that light adaptation leads to surround expansion in pDSGCs. The pDSGCs maintain their original directional tuning in the centre receptive field, but present the oppositely tuned response in their surround. Notably, although inhibition is the main substrate for retinal direction selectivity, we found that following light adaptation, both the centre‐ and surround‐mediated responses originate from directionally tuned excitatory inputs. Multi‐electrode array recordings show similar oppositely tuned responses in other DSGC subtypes. Together, these data attribute a new role for excitation in the direction‐selective circuit. This excitation carries an antagonistic centre–surround property, possibly designed to sharpen the detection of motion direction in the retina. Key points: Receptive fields of direction‐selective retinal ganglion cells expand asymmetrically following light adaptation.The increase in the surround receptive field generates a delayed spiking phase that is tuned to the null direction and is mediated by excitation.Following light adaptation, excitation rules the computation in the centre receptive field and is tuned to the preferred direction.GABAergic and glycinergic inputs modulate the null‐tuned delayed response differentially.Null‐tuned delayed spiking phases can be detected in all types of direction‐selective retinal ganglion cells.Light adaptation exposes a hidden directional excitation in the circuit, which is tuned to opposite directions in the centre and surround receptive fields. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

24. Cross-scale information enhancement for object detection.

Author: Li, Tie-jun and Zhao, Hui-feng
Subjects: MULTISCALE modeling, PROBLEM solving, DETECTORS, INFORMATION design
Abstract: Object detection usually adopts multi-scale fusion to enrich the information of the object, and the Feature Pyramid Network (FPN) is a common method for multi-scale fusion. However, traditional fusion methods such as FPN cause information loss when fusing high-level feature maps with low-level feature maps. To solve these problems, we propose a simple but effective cross-scale fusion method that fully uses the information of multi-scale feature maps. In addition, to better utilize the multi-scale contextual information, we designed the Selective Information Enhancement (SIE) module. The SIE dynamically selects information at more important scales for objects of different size and fuse the selected information with feature maps for information enhancement. Apply our method to Single Shot Multibox Detector (SSD) and propose a Cross-Scale Information Enhancement Single Shot Multibox Detector (CESSD). The CESSD improves the object detection capability of SSD models by fusing multi-scale features and selectively enhancing feature map information. To evaluate the effectiveness of the model, we validated it on the Pascal VOC2007 test set for 300 × 300 inputs, and the mean Average Precision (mAP) of CESSD reached 79.8%. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

25. 7 - The Somatosensory System

Published: 2024
Full Text: View/download PDF

26. Grape clusters detection based on multi-scale feature fusion and augmentation

Author: Jinlin Ma, Silong Xu, Ziping Ma, Hong Fu, and Baobao Lin
Subjects: Grape clusters detection, Multi-scale, Receptive field, Feature fusion, Feature augmentation, Medicine, Science
Abstract: Abstract This paper addresses the challenge of low detection accuracy of grape clusters caused by scale differences, illumination changes, and occlusion in realistic and complex scenes. We propose a multi-scale feature fusion and augmentation YOLOv7 network to enhance the detection accuracy of grape clusters across variable environments. First, we design a Multi-Scale Feature Extraction Module (MSFEM) to enhance feature extraction for small-scale targets. Second, we propose the Receptive Field Augmentation Module (RFAM), which uses dilated convolution to expand the receptive field and enhance the detection accuracy for objects of various scales. Third, we present the Spatial Pyramid Pooling Cross Stage Partial Concatenation Faster (SPPCSPCF) module to fuse multi-scale features, improving accuracy and speeding up model training. Finally, we integrate the Residual Global Attention Mechanism (ResGAM) into the network to better focus on crucial regions and features. Experimental results show that our proposed method achieves a mAP $$_{0.5}$$ 0.5 of 93.29% on the GrappoliV2 dataset, an improvement of 5.39% over YOLOv7. Additionally, our method increases Precision, Recall, and F1 score by 2.83%, 3.49%, and 0.07, respectively. Compared to state-of-the-art detection methods, our approach demonstrates superior detection performance and adaptability to various environments for detecting grape clusters.
Published: 2024
Full Text: View/download PDF

27. Grape clusters detection based on multi-scale feature fusion and augmentation.

Author: Ma, Jinlin, Xu, Silong, Ma, Ziping, Fu, Hong, and Lin, Baobao
Abstract: This paper addresses the challenge of low detection accuracy of grape clusters caused by scale differences, illumination changes, and occlusion in realistic and complex scenes. We propose a multi-scale feature fusion and augmentation YOLOv7 network to enhance the detection accuracy of grape clusters across variable environments. First, we design a Multi-Scale Feature Extraction Module (MSFEM) to enhance feature extraction for small-scale targets. Second, we propose the Receptive Field Augmentation Module (RFAM), which uses dilated convolution to expand the receptive field and enhance the detection accuracy for objects of various scales. Third, we present the Spatial Pyramid Pooling Cross Stage Partial Concatenation Faster (SPPCSPCF) module to fuse multi-scale features, improving accuracy and speeding up model training. Finally, we integrate the Residual Global Attention Mechanism (ResGAM) into the network to better focus on crucial regions and features. Experimental results show that our proposed method achieves a mAP 0.5 of 93.29% on the GrappoliV2 dataset, an improvement of 5.39% over YOLOv7. Additionally, our method increases Precision, Recall, and F1 score by 2.83%, 3.49%, and 0.07, respectively. Compared to state-of-the-art detection methods, our approach demonstrates superior detection performance and adaptability to various environments for detecting grape clusters. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

28. Stage-by-Stage Adaptive Alignment Mechanism for Object Detection in Aerial Images.

Author: Zhu, Jiangang, Jing, Donglin, and Gao, Dapeng
Subjects: REMOTE sensing, DYNAMIC models, DETECTORS, ANCHORS, CLASSIFICATION
Abstract: Object detection in aerial images has had a broader range of applications in the past few years. Unlike the targets in the images of horizontal shooting, targets in aerial photos generally have arbitrary orientation, multi-scale, and a high aspect ratio. Existing methods often employ a classification backbone network to extract translation-equivariant features (TEFs) and utilize many predefined anchors to handle objects with diverse appearance variations. However, they encounter misalignment at three levels, spatial, feature, and task, during different detection stages. In this study, we propose a model called the Staged Adaptive Alignment Detector (SAADet) to solve these challenges. This method utilizes a Spatial Selection Adaptive Network (SSANet) to achieve spatial alignment of the convolution receptive field to the scale of the object by using a convolution sequence with an increasing dilation rate to capture the spatial context information of different ranges and evaluating this information through model dynamic weighting. After correcting the preset horizontal anchor to an oriented anchor, feature alignment is achieved through the alignment convolution guided by oriented anchor to align the backbone features with the object's orientation. The decoupling of features using the Active Rotating Filter is performed to mitigate inconsistencies due to the sharing of backbone features in regression and classification tasks to accomplish task alignment. The experimental results show that SAADet achieves equilibrium in speed and accuracy on two aerial image datasets, HRSC2016 and UCAS-AOD. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

29. MRI-Based Brain Tumor Classification Using a Dilated Parallel Deep Convolutional Neural Network.

Author: Rahman, Takowa, Islam, Md Saiful, and Uddin, Jia
Subjects: BRAIN tumors, CONVOLUTIONAL neural networks, MACHINE learning, DATA analysis, ACCURACY
Abstract: Brain tumors are frequently classified with high accuracy using convolutional neural networks (CNNs) to better comprehend the spatial connections among pixels in complex pictures. Due to their tiny receptive fields, the majority of deep convolutional neural network (DCNN)-based techniques overfit and are unable to extract global context information from more significant regions. While dilated convolution retains data resolution at the output layer and increases the receptive field without adding computation, stacking several dilated convolutions has the drawback of producing a grid effect. This research suggests a dilated parallel deep convolutional neural network (PDCNN) architecture that preserves a wide receptive field in order to handle gridding artifacts and extract both coarse and fine features from the images. This article applies multiple preprocessing strategies to the input MRI images used to train the model. By contrasting various dilation rates, the global path uses a low dilation rate (2,1,1), while the local path uses a high dilation rate (4,2,1) for decremental even numbers to tackle gridding artifacts and to extract both coarse and fine features from the two parallel paths. Using three different types of MRI datasets, the suggested dilated PDCNN with the average ensemble method performs best. The accuracy achieved for the multiclass Kaggle dataset-III, Figshare dataset-II, and binary tumor identification dataset-I is 98.35%, 98.13%, and 98.67%, respectively. In comparison to state-of-the-art techniques, the suggested structure improves results by extracting both fine and coarse features, making it efficient. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

30. SmokeFireNet: A Lightweight Network for Joint Detection of Forest Fire and Smoke.

Author: Chen, Yi and Wang, Fang
Subjects: FOREST fires, EXTREME weather, FOREST protection, POLLUTION, FOREST microclimatology
Abstract: In recent years, forest fires have been occurring frequently around the globe, affected by extreme weather and dry climate, causing serious economic losses and environmental pollution. In this context, timely detection of forest fire smoke is crucial for realizing real-time early warning of fires. However, fire and smoke from forest fires can spread to cover large areas and may affect distant areas. In this paper, a lightweight joint forest fire and smoke detection network, SmokeFireNet, is proposed, which employs ShuffleNetV2 as the backbone for efficient feature extraction, effectively addressing the computational efficiency challenges of traditional methods. To integrate multi-scale information and enhance the semantic feature extraction capability, a feature pyramid network (FPN) and path aggregation network (PAN) are introduced in this paper. In addition, the FPN network is optimized by a lightweight DySample upsampling operator. The model also incorporates efficient channel attention (ECA), which can pay more attention to the detection of forest fires and smoke regions while suppressing irrelevant features. Finally, by embedding the receptive field block (RFB), the model further improves its ability to understand contextual information and capture detailed features of fire and smoke, thus improving the overall detection accuracy. The experimental results show that SmokeFireNet is better than other mainstream target detection algorithms in terms of average APall of 86.2%, FPS of 114, and GFLOPs of 8.4, and provides effective technical support for forest fire prevention work in terms of average precision, frame rate, and computational complexity. In the future, the SmokeFireNet model is expected to play a greater role in the field of forest fire prevention and make a greater contribution to the protection of forest resources and the ecological environment. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

31. MS-HRNet: multi-scale high-resolution network for human pose estimation.

Author: Wang, Yanxia, Wang, Renjie, Shi, Hu, and Liu, Dan
Subjects: *POSE estimation (Computer vision), *PARKINSON'S disease, *PARAMETERIZATION, *AUTISTIC children, *AUTISM in children, *HUMAN-computer interaction, *DEEP learning
Abstract: Human pose estimation has important applications in medical diagnosis (such as early diagnosis of autism in children and assisting with the diagnosis of Parkinson's disease), human-computer interaction, animation, and other fields. Currently, many human pose estimation algorithms are based on deep learning. However, most research focuses only on increasing the depth and width of the network model. This approach overlooks that merely enlarging the network's depth and width results in excessive parameterization, without enhancing the model's effective receptive field or its ability to extract multi-scale features. Hence, this paper constructs a network model, named MS-HRNet (Multi-Scale High-Resolution Network), for human pose estimation. Specifically, we propose a more concise and efficient version of HRNet framework as the backbone network of MS-HRNet. This addresses the challenges of HRNet complex structure and large number of parameters that cause training difficulties, and its inadequacy in handling multi-scale information. Additionally, we designed a multi-scale convolutional kernel parallel module named MSBlock (Multi-Scale Block) as the basic block of MS-HRNet. By introducing coordinate attention modules and ASFF (Adaptive Spatial Feature Fusion) modules, the model's ability to extract information is effectively increased, and the issue of feature conflict during the fusion of features with different resolutions is resolved, with only a small increase in the number of model parameters. To evaluate the effectiveness of the proposed model, we conducted comparison experiment and ablation experiments using popular human pose estimation datasets, including COCO2017 and MPII, against multiple existing human pose estimation models.On the COCO 2017 dataset, the number of MS-HRNet parameters are decreased by 41% than the baseline model HRNet, the computational complexity by 59%, and the detection accuracies(mAP) are increased by 2.4 point. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

32. 改进YOLOX的夜间安全帽检测算法.

Author: 韩贵金, 王瑞萱, 徐午言, and 李君
Abstract: Copyright of Journal of Computer Engineering & Applications is the property of Beijing Journal of Computer Engineering & Applications Journal Co Ltd. and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

33. 基于深度学习方法的传送带缺陷检测.

Author: 钟信 and 彭力
Abstract: Copyright of Computer Measurement & Control is the property of Magazine Agency of Computer Measurement & Control and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

34. A Lightweight CER-YOLOv5s Algorithm for Detection of Construction Vehicles at Power Transmission Lines.

Author: Yu, Pingping, Yan, Yuting, Tang, Xinliang, Shang, Yan, and Su, He
Subjects: ELECTRIC lines, FEATURE extraction, PYRAMIDS, ALGORITHMS
Abstract: In the context of power-line scenarios characterized by complex backgrounds and diverse scales and shapes of targets, and addressing issues such as large model parameter sizes, insufficient feature extraction, and the susceptibility to missing small targets in engineering-vehicle detection tasks, a lightweight detection algorithm termed CER-YOLOv5s is firstly proposed. The C3 module was restructured by embedding a lightweight Ghost bottleneck structure and convolutional attention module, enhancing the model's ability to extract key features while reducing computational costs. Secondly, an E-BiFPN feature pyramid network is proposed, utilizing channel attention mechanisms to effectively suppress background noise and enhance the model's focus on important regions. Bidirectional connections were introduced to optimize the feature fusion paths, improving the efficiency of multi-scale feature fusion. At the same time, in the feature fusion part, an ERM (enhanced receptive module) was added to expand the receptive field of shallow feature maps through multiple convolution repetitions, enhancing the global information perception capability in relation to small targets. Lastly, a Soft-DIoU-NMS suppression algorithm is proposed to improve the candidate box selection mechanism, addressing the issue of suboptimal detection of occluded targets. The experimental results indicated that compared with the baseline YOLOv5s algorithm, the improved algorithm reduced parameters and computations by 27.8% and 31.9%, respectively. The mean average precision (mAP) increased by 2.9%, reaching 98.3%. This improvement surpasses recent mainstream algorithms and suggests stronger robustness across various scenarios. The algorithm meets the lightweight requirements for embedded devices in power-line scenarios. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

35. Receptive Field Space for Point Cloud Analysis.

Author: Jiang, Zhongbin, Tao, Hai, and Liu, Ye
Subjects: *POINT cloud, *CONVOLUTIONAL neural networks, *IMAGE processing
Abstract: Similar to convolutional neural networks for image processing, existing analysis methods for 3D point clouds often require the designation of a local neighborhood to describe the local features of the point cloud. This local neighborhood is typically manually specified, which makes it impossible for the network to dynamically adjust the receptive field's range. If the range is too large, it tends to overlook local details, and if it is too small, it cannot establish global dependencies. To address this issue, we introduce in this paper a new concept: receptive field space (RFS). With a minor computational cost, we extract features from multiple consecutive receptive field ranges to form this new receptive field space. On this basis, we further propose a receptive field space attention mechanism, enabling the network to adaptively select the most effective receptive field range from RFS, thus equipping the network with the ability to adjust granularity adaptively. Our approach achieved state-of-the-art performance in both point cloud classification, with an overall accuracy (OA) of 94.2%, and part segmentation, achieving an mIoU of 86.0%, demonstrating the effectiveness of our method. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

36. Multi-scale context fusion network for melanoma segmentation.

Author: Zhenhua Li and Lei Zhang
Abstract: Aiming at the problems that the edge of melanoma image is fuzzy, the contrast with the background is low, and the hair occlusion makes it difficult to segment accurately, this paper proposes a model MSCNet for melanoma segmentation based on U-net frame. Firstly, a multi-scale pyramid fusion module is designed to reconstruct the skip connection and transmit global information to the decoder. Secondly, the contextural information conduction module is innovatively added to the top of the encoder. The module provides different receptive fields for the segmented target by using the hole convolution with different expansion rates, so as to better fuse multi-scale contextural information. In addition, in order to suppress redundant information in the input image and pay more attention to melanoma feature information, global channel attention mechanism is introduced into the decoder. Finally, In order to solve the problem of lesion class imbalance, this paper uses a combined loss function. The algorithm of this paper is verified on ISIC 2017 and ISIC 2018 public datasets. The experimental results indicate that the proposed algorithm has better accuracy for melanoma segmentation compared with other CNN-based image segmentation algorithms. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

37. YOLO-BS: A Better Object Detection Model for Real-Time Driver Behavior Detection

Author: Xi, Yang, Guo, Jinxin, Ma, Ming, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Huang, De-Shuang, editor, Pan, Yijie, editor, and Guo, Jiayang, editor
Published: 2024
Full Text: View/download PDF

38. A Novel Facial Expression Recognition (FER) Model Using Multi-scale Attention Network

Author: Ghadai, Chakrapani, Patra, Dipti, Okade, Manish, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Kaur, Harkeerat, editor, Jakhetiya, Vinit, editor, Goyal, Puneet, editor, Khanna, Pritee, editor, Raman, Balasubramanian, editor, and Kumar, Sanjeev, editor
Published: 2024
Full Text: View/download PDF

39. FCGAN: Spectral Convolutions via FFT for Channel-Wide Receptive Field in Generative Adversarial Networks

Author: Gomes, Pedro H. B., Santos, Luiz Fernando, Gattass, Marcelo, Rannenberg, Kai, Editor-in-Chief, Soares Barbosa, Luís, Editorial Board Member, Carette, Jacques, Editorial Board Member, Tatnall, Arthur, Editorial Board Member, Neuhold, Erich J., Editorial Board Member, Stiller, Burkhard, Editorial Board Member, Stettner, Lukasz, Editorial Board Member, Pries-Heje, Jan, Editorial Board Member, Kreps, David, Editorial Board Member, Rettberg, Achim, Editorial Board Member, Furnell, Steven, Editorial Board Member, Mercier-Laurent, Eunika, Editorial Board Member, Winckler, Marco, Editorial Board Member, Malaka, Rainer, Editorial Board Member, Maglogiannis, Ilias, editor, Iliadis, Lazaros, editor, Macintyre, John, editor, Avlonitis, Markos, editor, and Papaleonidas, Antonios, editor
Published: 2024
Full Text: View/download PDF

40. Perceptive Fields and the Study of Inherited Retinal Degeneration

Author: Rizzi, Matteo, Powell, Kate, Singh, Arun D., Series Editor, Prakash, Gyan, editor, and Iwata, Takeshi, editor
Published: 2024
Full Text: View/download PDF

41. SAMDConv: Spatially Adaptive Multi-scale Dilated Convolution

Author: Hu, Haigen, Yu, Chenghan, Zhou, Qianwei, Guan, Qiu, Chen, Qi, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Liu, Qingshan, editor, Wang, Hanzi, editor, Ma, Zhanyu, editor, Zheng, Weishi, editor, Zha, Hongbin, editor, Chen, Xilin, editor, Wang, Liang, editor, and Ji, Rongrong, editor
Published: 2024
Full Text: View/download PDF

42. TRFN: Triple-Receptive-Field Network for Regional-Texture and Holistic-Structure Image Inpainting

Author: Xiao, Qingguo, Han, Zhiyuan, Liu, Zhaodong, Pan, Guangyuan, Zheng, Yanpeng, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Luo, Biao, editor, Cheng, Long, editor, Wu, Zheng-Guang, editor, Li, Hongyi, editor, and Li, Chaojie, editor
Published: 2024
Full Text: View/download PDF

43. Knowledge Distillation via Information Matching

Author: Zhu, Honglin, Jiang, Ning, Tang, Jialiang, Huang, Xinlei, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Luo, Biao, editor, Cheng, Long, editor, Wu, Zheng-Guang, editor, Li, Hongyi, editor, and Li, Chaojie, editor
Published: 2024
Full Text: View/download PDF

44. Learning delays through gradients and structure: emergence of spatiotemporal patterns in spiking neural networks

Author: Balázs Mészáros, James C. Knight, and Thomas Nowotny
Subjects: spiking neural network, delay learning, dynamic pruning, receptive field, sparse connectivity, Neurosciences. Biological psychiatry. Neuropsychiatry, RC321-571
Abstract: We present a Spiking Neural Network (SNN) model that incorporates learnable synaptic delays through two approaches: per-synapse delay learning via Dilated Convolutions with Learnable Spacings (DCLS) and a dynamic pruning strategy that also serves as a form of delay learning. In the latter approach, the network dynamically selects and prunes connections, optimizing the delays in sparse connectivity settings. We evaluate both approaches on the Raw Heidelberg Digits keyword spotting benchmark using Backpropagation Through Time with surrogate gradients. Our analysis of the spatio-temporal structure of synaptic interactions reveals that, after training, excitation and inhibition group together in space and time. Notably, the dynamic pruning approach, which employs DEEP R for connection removal and RigL for reconnection, not only preserves these spatio-temporal patterns but outperforms per-synapse delay learning in sparse networks. Our results demonstrate the potential of combining delay learning with dynamic pruning to develop efficient SNN models for temporal data processing. Moreover, the preservation of spatio-temporal dynamics throughout pruning and rewiring highlights the robustness of these features, providing a solid foundation for future neuromorphic computing applications.
Published: 2024
Full Text: View/download PDF

45. Alpha-2 nicotinic acetylcholine receptors regulate spectral integration in auditory cortex

Author: Irakli Intskirveli, Susan Gil, Ronit Lazar, and Raju Metherate
Subjects: nicotine, mouse, receptive field, electrophysiology, current-source density, neuromodulation, Neurosciences. Biological psychiatry. Neuropsychiatry, RC321-571
Abstract: IntroductionIn primary auditory cortex (A1), nicotinic acetylcholine receptors (nAChRs) containing α2 subunits are expressed in layer 5 Martinotti cells (MCs)—inhibitory interneurons that send a main axon to superficial layers to inhibit distal apical dendrites of pyramidal cells (PCs). MCs also contact interneurons in supragranular layers that, in turn, inhibit PCs. Thus, MCs may regulate PCs via inhibition and disinhibition, respectively, of distal and proximal apical dendrites. Auditory inputs to PCs include thalamocortical inputs to middle layers relaying information about characteristic frequency (CF) and near-CF stimuli, and intracortical long-distance (“horizontal”) projections to multiple layers carrying information about spectrally distant (“nonCF”) stimuli. CF and nonCF inputs integrate to create broad frequency receptive fields (RFs). Systemic administration of nicotine activates nAChRs to “sharpen” RFs—to increase gain within a narrowed RF—resulting in enhanced responses to CF stimuli and reduced responses to nonCF stimuli. While nicotinic mechanisms to increase gain have been identified, the mechanism underlying RF narrowing is unknown.MethodsHere, we examine the role of α2 nAChRs in mice with α2 nAChR-expressing neurons labeled fluorescently, and in mice with α2 nAChRs genetically deleted.ResultsThe distribution of fluorescent neurons in auditory cortex was consistent with previous studies demonstrating α2 nAChRs in layer 5 MCs, including nonpyramidal somata in layer 5 and dense processes in layer 1. We also observed label in subcortical auditory regions, including processes, but no somata, in the medial geniculate body, and both fibers and somata in the inferior colliculus. Using electrophysiological (current-source density) recordings in α2 nAChR knock-out mice, we found that systemic nicotine failed to enhance CF-evoked inputs to layer 4, suggesting a role for subcortical α2 nAChRs, and failed to reduce nonCF-evoked responses, suggesting that α2 nAChRs regulate horizontal projections to produce RF narrowing.DiscussionThe results support the hypothesis that α2 nAChRs function to simultaneously enhance RF gain and narrow RF breadth in A1. Notably, a similar neural circuit may recur throughout cortex and hippocampus, suggesting widespread conserved functions regulated by α2 nAChRs.
Published: 2024
Full Text: View/download PDF

46. MRI-Based Brain Tumor Classification Using a Dilated Parallel Deep Convolutional Neural Network

Author: Takowa Rahman, Md Saiful Islam, and Jia Uddin
Subjects: brain tumor classification, data augmentation, grid effect, multiscale dilated parallel convolution, machine learning classifiers, receptive field, Electronic computers. Computer science, QA75.5-76.95
Abstract: Brain tumors are frequently classified with high accuracy using convolutional neural networks (CNNs) to better comprehend the spatial connections among pixels in complex pictures. Due to their tiny receptive fields, the majority of deep convolutional neural network (DCNN)-based techniques overfit and are unable to extract global context information from more significant regions. While dilated convolution retains data resolution at the output layer and increases the receptive field without adding computation, stacking several dilated convolutions has the drawback of producing a grid effect. This research suggests a dilated parallel deep convolutional neural network (PDCNN) architecture that preserves a wide receptive field in order to handle gridding artifacts and extract both coarse and fine features from the images. This article applies multiple preprocessing strategies to the input MRI images used to train the model. By contrasting various dilation rates, the global path uses a low dilation rate (2,1,1), while the local path uses a high dilation rate (4,2,1) for decremental even numbers to tackle gridding artifacts and to extract both coarse and fine features from the two parallel paths. Using three different types of MRI datasets, the suggested dilated PDCNN with the average ensemble method performs best. The accuracy achieved for the multiclass Kaggle dataset-III, Figshare dataset-II, and binary tumor identification dataset-I is 98.35%, 98.13%, and 98.67%, respectively. In comparison to state-of-the-art techniques, the suggested structure improves results by extracting both fine and coarse features, making it efficient.
Published: 2024
Full Text: View/download PDF

47. Stable 3D Deep Convolutional Autoencoder Method for Ultrasonic Testing of Defects in Polymer Composites.

Author: Liu, Yi, Yu, Qing, Liu, Kaixin, Zhu, Ningtao, and Yao, Yuan
Subjects: *POLYMER testing, *ULTRASONIC imaging, *SURFACE defects, *ULTRASONIC testing, *ECHO
Abstract: Ultrasonic testing is widely used for defect detection in polymer composites owing to advantages such as fast processing speed, simple operation, high reliability, and real-time monitoring. However, defect information in ultrasound images is not easily detectable because of the influence of ultrasound echoes and noise. In this study, a stable three-dimensional deep convolutional autoencoder (3D-DCA) was developed to identify defects in polymer composites. Through 3D convolutional operations, it can synchronously learn the spatiotemporal properties of the data volume. Subsequently, the depth receptive field (RF) of the hidden layer in the autoencoder maps the defect information to the original depth location, thereby mitigating the effects of the defect surface and bottom echoes. In addition, a dual-layer encoder was designed to improve the hidden layer visualization results. Consequently, the size, shape, and depth of the defects can be accurately determined. The feasibility of the method was demonstrated through its application to defect detection in carbon-fiber-reinforced polymers. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

48. 基于优化感受野策略的图像修复方法.

Author: 刘恩泽, 刘华明, 王秀友, and 毕学慧
Abstract: The currently popular image inpainting methods based on deep neural network typically employ large receptive field feature extractors. However, when restoring local patterns and textures, they often generate artifacts or distorted textures, thus failing to recover the overall semantic and visual structure of the image. To address this issue, this paper proposed a novel image inpainting method, called ORFNet, which combined coarse and fine inpainting by employing an optimized receptive field strategy. Initially, it obtained a coarse inpainting result by using a generative adversarial network with a large receptive field. Subsequently, it used a model with a small receptive field to refine local texture details. Finally, it performed a global refinement inpainting by using an encoder-decoder network based on attention mechanisms. Validation on the CelebA, Paris StreetView, and Places2 datasets demonstrates that ORFNet outperforms existing representative inpainting methods. It leads to 1.98 dB increase in PSNR and 2.49% improvement in SSIM, along with average 2.4% reduction in LPIPS. Experimental results confirm the effectiveness of the proposed image inpainting method, showcasing superior performance across various receptive field settings and achieving more realistic and natural visual outcome. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

49. Marr's three levels of analysis are useful as a framework for neuroscience.

Author: Lengyel, Máté
Subjects: *ACTION potentials, *NEUROSCIENCES, *DENDRITES, *NEURAL circuitry, *VISION
Published: 2024
Full Text: View/download PDF

50. Optimization of segmentation model based on maximization information fusion and its application in nuclear image analysis.

Author: Xiong, Feiyan and Wei, Yun
Subjects: *IMAGE segmentation, *HEMATOXYLIN & eosin staining, *IMAGE analysis, *IMAGE fusion
Abstract: The Whole Slide Image (WSI) is a pathological image with Hematoxylin & Eosin staining. The low-contrast color staining will bring a challenge on analysis. We propose SNSeg (Staining Nuclear Segmentation) to improve the segmentation performance in WSI, for obtaining accurate nuclear region. At the macro level, we reconstructed the feature fusion mode and connection path, for reducing semantic loss in the gradient descent. At the micro level, first, we design a multiple receptive field convolution unit (RFC), and it can adjust the receptive field for adapting to the nuclei size of the input image. Secondly, for efficiently fusing the feature information extracted from the encoder, we design a multi-branch channel attention fusion unit (MCA), which integrates different branch information flows in channel-wise to a unified module. Finally, we design parallel outputting decoder fusion (DF) module to fuse outputting spatial attention for generating the final segmentation results. In addition, we introduce the watershed based on distance transformation to separate adherent nuclei and mark contours. We design experiments for verifying SNSeg on public datasets of MoNuSeg, TNBC, and PanNuKe. The segmentation results on MoNuSeg show that the SNSeg has achieves an accuracy of 84.32% and a Dice score of 81.21%. Compared with other networks, the SNSeg have competitive advantages in segmentation performance and network parameters. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

15,078 results on '"Receptive field"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources