Where's the Dirt? Why We're Losing Touch with Real Agricultural Data

Following my previous discussion on the limitations of synthetic data in agricultural AI (" Stop Swinging That Hammer: Why Synthetic Data is Banging Up Agricultural AI "), a pertinent question arose: Why do researchers opt for synthetic data over real-world data in the first place? It's a question that warrants careful consideration, particularly given the implications for the validity and applicability of AI in agriculture. One would expect a data-driven approach – the design and execution of field experiments to generate targeted datasets – to be the prevailing methodology. However, this is not consistently observed. Several factors appear to contribute to the preference for synthetic data: Perceived Complexity of Real Data: There exists a perception that the acquisition and preprocessing of real-world agricultural data is excessively challenging. While it is true that real data can be complex and require significant effort, this should not deter rigorous sc...