Binary Classification of Agricultural Crops Using Sentinel Satellite Data and Machine Learning Techniques
Paolo Bertellini, Gianluca D'Addese, Giorgia Franchini, Simone Parisi, Carmelo Scribano, Daniele Zanirato, Marko Bertogna
DOI: http://dx.doi.org/10.15439/2023F1703
Citation: Proceedings of the 18th Conference on Computer Science and Intelligence Systems, M. Ganzha, L. Maciaszek, M. Paprzycki, D. Ślęzak (eds). ACSIS, Vol. 35, pages 859–864 (2023)
Abstract. The advent of high-resolution satellite imagery, such as Sentinel 1 and Sentinel 2, has provided valuable data for various applications, including crop classification. This paper presents a study on the classification of agricultural fields using indices derived from Sentinel satellite imagery. Specifically, we focus on creating binary classifiers capable of distinguishing between different crops, namely Tomatoes, Soy, Sugar Beets, Rice, and Wheat. The paper investigates various preprocessing techniques to create a dataset suitable for machine learning methods, such as Random Forests, which require a fixed number of features. Additionally, we demonstrate that linear interpolation and out-of-scale values have equivalent performance in terms of classification accuracy. Furthermore, we address the issue of imbalanced datasets commonly encountered in agricultural field classification. We explore different balancing techniques that can significantly improve the performance of machine learning methodologies. The motivation for this work stems from the growing interest in Agriculture 4.0, and it serves as a valuable tool to verify farmers' claims, especially in relation to state subsidies for specific crops of interest. Understanding the crop type present in the field represents highly valuable information that can serve as a foundation for subsequent analyses or as input for calibrating models, such as Decision Support Systems. Overall, this study contributes to the field of agricultural research and provides insights into the application of Machine Learning techniques for crop classification using satellite imagery. The findings offer practical implications for monitoring and optimizing agricultural practices in the context of precision farming and sustainable agriculture.
References
- Belgiu, M., Drăguţ, L.: Random forest in remote sensing: A review of applications and future directions. ISPRS journal of photogrammetry and remote sensing 114, 24–31 (2016)
- Breiman, L.: Random forests. Machine learning 45, 5–32 (2001)
- Campos-Taberner, M., Garcı́a-Haro, F.J., Martı́nez, B., Sánchez-Ruı́z, S., Gilabert, M.A.: A copernicus sentinel-1 and sentinel-2 classification framework for the 2020+ european common agricultural policy: A case study in valència (spain). Agronomy 9(9) (2019). https://doi.org/10.3390/agronomy9090556, https://www.mdpi.com/2073-4395/9/9/556
- Chapelle, O., Haffner, P., Vapnik, V.N.: Support vector machines for histogram-based image classification. IEEE transactions on Neural Networks 10(5), 1055–1064 (1999)
- Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: Smote: synthetic minority over-sampling technique. Journal of artificial intelligence research 16, 321–357 (2002)
- Drummond, C., Holte, R.C., et al.: C4. 5, class imbalance, and cost sensitivity: why under-sampling beats over-sampling. In: Workshop on learning from imbalanced datasets II. vol. 11, pp. 1–8 (2003)
- Immitzer, M., Vuolo, F., Atzberger, C.: First experience with sentinel-2 data for crop and tree species classifications in central europe. Remote Sensing 8(3) (2016). https://doi.org/10.3390/rs8030166
- Lepot, M., Aubin, J.B., Clemens, F.H.: Interpolation in time series: An introductive overview of existing methods, their performance criteria and uncertainty assessment. Water 9(10), 796 (2017)
- Nasrallah, A., Baghdadi, N., Mhawej, M., Faour, G., Darwish, T., Belhouchette, H., Darwich, S.: A novel approach for mapping wheat areas using high resolution sentinel-2 images. Sensors 18(7) (2018). https://doi.org/10.3390/s18072089
- Piedelobo, L., Hernández-López, D., Ballesteros, R., Chakhar, A., Del Pozo, S., González-Aguilera, D., Moreno, M.A.: Scalable pixel-based crop classification combining sentinel-2 and landsat-8 data time series: Case study of the duero river basin. Agricultural Systems 171, 36–50 (2019). https://doi.org/https://doi.org/10.1016/j.agsy.2019.01.005
- Wardlow, B., Egbert, S., Kastens, J.: Analysis of time-series modis 250 m vegetation index data for crop classification in the u.s. central great plains. Remote Sensing of Environment 108, 290–310 (06 2007). https://doi.org/10.1016/j.rse.2006.11.021
- Yi, Z., Jia, L., Chen, Q.: Crop classification using multi-temporal sentinel-2 data in the shiyang river basin of china. Remote Sensing 12(24) (2020), https://www.mdpi.com/2072-4292/12/24/4052
- You, N., Dong, J.: Examining earliest identifiable timing of crops using all available sentinel 1/2 imagery and google earth engine. ISPRS Journal of Photogrammetry and Remote Sensing 161, 109–123 (2020). https://doi.org/https://doi.org/10.1016/j.isprsjprs.2020.01.001