CenterFormer: Coupling CenterNet and Vision Transformer for Accurate Wheat Head Detection

General Information

ISSN: 2301-3699 (Print); 2972-3973 (Online)
Frequency: Bimonthly
Managing Editor: Ms. Alice Loh
DOI: 10.18178/joig
Abstracting/Indexing: Scopus (Since 2021), CNKI, Google Scholar, Crossref, etc.
APC: 500 USD
Average Days to Accept: 98 days
Acceptance Rate: 19%
E-mail: editor@joig.net
Journal Metrics:
5.0

2024CiteScore

69rd percentile

Powered by

Editor-in-Chief

Dr. Branislav Vuksanovic
Deputy Head of Department, Systems Engineering Department, Military Technological College, Muscat, Oman
I am very excited to serve as the first Editor-in-Chief of the International Journal of Image and Graphics (JOIG) and hope that the publication can enrich the readers’ experience... [Read More]

What's New

2025-06-04

All papers published in Vol. 13, No. 2 have been indexed by SCOPUS.

2025-06-04

JOIG received the CiteScore 2024 with 5.0

2025-04-30

Volume 13, No. 2 has been published now.

Home > Articles > All Issues > 2025 > Volume 13, No. 5, 2025 >

JOIG 2025 Vol.13(5):476-488
doi: 10.18178/joig.13.5.476-488

Ekei Harimoto and Xian-Hua Han *

Graduate School of Artificial Intelligence and Science, Rikkyo University, Tokyo, Japan
Email: 24vr032p@rikkyo.ac.jp (E.H.); hanxhua@rikkyo.ac.jp (X.-H.H.)
*Corresponding author

Manuscript received January 17, 2025; revised April 7, 2025; accepted May 26, 2025; published September 17, 2025.

Abstract—Wheat is a staple crop cultivated widely across the world, making effective management of wheat fields a critical task. A key component of this management is accurately identifying and counting wheat heads, which provides essential data for assessing growth conditions, estimating crop yields and optimizing agricultural. This study introduces a novel approach for automatic wheat head detection by treating the wheat head as a single point to avoid ambiguous annotation of dense objects while leveraging the long-range dependency modeling capabilities of Transformer architecture to learn multi-scale features for head prediction, dubbed as CenterFormer. Specifically, we employ a hierarchical Transformer architecture with self-attention exploitation in both spatial and channel domains as the backbone to extract multi-scale features in the hierarchical stages. To maintain the linear complexity of the Transformer block, we implement window-based self-attention in spatial domain and group-wised self-attention in channel direction. In addition, to leverage the multi-scale features with both detailed spatial information and abstracted semantic contexts, we design a simple yet effective fusion block to integrate these features for enhanced wheat prediction. The prediction block aims to estimate a heat map, denoting the probabilities if the points are located at the centers of the wheat heads, and regresses other object properties such as size and sub-pixel deviations for each center location. Extensive experiments on the Global Wheat Head Detection (GWHD) dataset have demonstrated that our proposed method achieves substantial performance improvements compared with the state-of-the-art object detection models.

Keywords—wheat head detection, transformer, self-attention, multi-scale feature fusion, hierarchical architecture, center point, CenterNet

Cite: Ekei Harimoto and Xian-Hua Han, "CenterFormer: Coupling CenterNet and Vision Transformer for Accurate Wheat Head Detection," Journal of Image and Graphics, Vol. 13, No. 5, pp. 476-488, 2025.

Copyright © 2025 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC-BY-4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.

附件说明

Article Metrics in Dimensions

PREVIOUS PAPER

Evaluating Facial Emotional Proportion Based on Computer Vision Technique

NEXT PAPER

Binomial Dropout Convolutional Neural Network Classifier-Based Micro-expression Recognition System

Home

Articles

Author Guide

Editor Guide

Reviewer Guide

Topics and Special Issues

journal menu