Summary
The huge set of fluorescence data collected on GFP is best explained using simple statistical models (1). In that paper it was found that multidimensional Gaussian processes will put the vast majority of the variation (>97%) on the first component.
1.
Tonner PD, Pressman A, Ross D. Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power. Proceedings of the National Academy of Sciences. 2022;119(26). Available from: https://doi.org/10.1073/pnas.2114021119