BII-Implementation: The causes and consequences of plant biodiversity across scales in a rapidly changing world

The proposed Biology Integration Institute will bring together two major research institutions in the Upper Midwest—the University of Minnesota (UMN) and University of Wisconsin-Madison (UW)—to investigate the causes and consequences of plant biodiversity across scales in a rapidly changing world—from genes and molecules within cells and tissues to communities, ecosystems, landscapes and the biosphere. The Institute focuses on plant biodiversity, defined broadly to encompass the heterogeneity within life that occurs from the smallest to the largest biological scales. A premise of the Institute is ‡ ‡ § | ‡


consequences of changes in plant biodiversity across scales in a rapidly changing world?
In pursuing this question, we will train the next generation of diverse integrative biologists, including members of the Native American community, and engage K-12 audiences about the nature and consequences of Earth's biodiversity.
Decades of research on biodiversity and its relationship to ecosystem functions have revealed that variation in life on Earth matters for how ecosystems cycle elements, produce biomass, and respond to environmental change. Integration across biological scales is required to address fundamental questions that remain poorly understood including: 1) why and how life's diversity matters at the largest scales-for biosphere function and dynamics critical to maintaining Earth's life support systems-and 2) how and why variation at the smallest scales-genes and molecules within cells and tissues-influences processes at all other scales. The scientific advances required to tackle this set of problems have been hindered by the fragmentation of biology into specialized sub-disciplines that do not meaningfully connect these vastly different scales. Here we define the term 'biodiversity' not simply in its most common usage as species diversity but as a concept that also encompasses the variation in functional and evolutionary components within and among biological scales. As such our institute will address diversity at scales ranging from genomes to individuals and clades as well as from communities to ecosystems and the global biosphere.
One component of diversity is spectral variation, which results from chemical, anatomical, morphological, and architectural plant traits, that themselves may vary due to selection, evolutionary history, community composition and diversity, and environmental drivers. Spectral signatures can thus distinguish among different kinds of molecules in plants, as well as reveal variation across the range of scales we will study here: among leaves of individual plants, within and among species across the tree of life, and within and among plant communities, ecosystems, and landscapes across the global biosphere. 'Spectral biology' is thus a powerful and integrated means to capture biological variation-or biodiversity-across scales and to determine the causal factors that give rise to that variation , Laliberté et al. 2020). More profoundly, spectral biology can also be used to make observations about the variation in plant functions at many biological scales that influence Earth's life support systems. As such, it is a revolutionary approach that contains the potential to advance predictions and models about vegetation function and its variation at successive scales ( Fig. 1)-up to and including the global biosphere-as well as how biodiversity and function respond to a changing environment.

Our primary mission is to uncover how the biological variation (i.e., diversity) and the processes that generate it at every scale influence processes and emergent properties at the next scale-ultimately to explain ecosystem and biosphere dynamics-and how changes in these global processes, in turn, drive biological variation at all other scales.
Conceptual Framework-Our overarching conceptual framework (Fig. 1) and approach to scaling across levels of biological variation, space and time envisions nested sets of biological systems (for simplicity shown in a sequence rather than nested in Fig. 1), where the emergent properties at one level are a function of the biological components and their interactive processes at the scale below. We posit that there are discernable rules explained by process-oriented theories that govern how system components give rise to emergent properties at each scale. Within this conceptual framework, we plan a series of studies within five project Themes to examine the interaction of components of variation (biodiversity) at one biological scale with environmental change at multiple spatial and temporal scales (Fig. 1). The project Themes we propose provide a framework that incorporates existing theories appropriate to each scale or scale transition using spectral biology, experiments, observations and models (Table 1).  Conceptual framework (left), showing the five Themes of the Institute (1-5) as the transitions between biological scales. Each transition involves the interaction of biodiversity components (genes, leaves, species, communities, ecosystems) with environmental variation (global change forces) at different temporal and spatial scales resulting in emergent properties, which are the basis of biological variation at other scales. Three different concepts of biological scale (physiological, evolutionary, and ecological) are incorporated, as detailed in Fig. 2. The processes (right) by which smaller scale components generate emergent properties can be modeled based on state-of-the-art theories and observations in a suite of empirical systems, through an iterative process that advances theory, detection methods and the (re)design of empirical approaches.

Core study themes that span biological scales
BII-Implementation: The causes and consequences of plant biodiversity across ... Theme 3 How functionally and spectrally distinct taxa, sampled across the tree of life, interact locally, leading to the assembly and dynamics of communities at multiple spatial scales, under current and future environmental conditions Theme 4 The consequences of biodiversity for ecosystem functioning and its response to global change Theme 5 How to improve parameterization of tissue-to ecosystem-scale properties at various spatial scales and advance land surface models that incorporate plant functional diversity In our conceptual framework (Fig. 1), we use three different conceptions of biological scale (Fig. 2) to integrate physiological, evolutionary and ecological processes under five research Themes. These processes (Fig. 2) cannot be positioned along a single linear scaling axis (Research Activities, Fig. 4), but instead are often considered in overlapping or sometimes orthogonal ways. For example, the variation in function within a species is a consequence of the variation in genetic architecture as it interacts with environmental variation (Theme 1). The diversity of species and their functional strategies emerge from evolution acting on functional variation as the processes of speciation and extinction unfold (Theme 2). The evolution of distinct species and lineages, in turn, provides species pools of functionally and spectrally distinct taxa which-through interactions with environmental variation-determine the neighborhood interactions that drive community assembly dynamics, and as a result, the functional and spectral composition of communities (Theme 3). The energy flow and material cycling properties of ecosystems emerge from environmental variation and collective species' traits (Theme 4). For example, the plant canopy, comprising the different species in the community, determines the light interception and energy and material fluxes from biosphere to atmosphere per unit area and is an emergent property of the community that results from the traits of individual plants and their interactions with each other and the environment (Ollinger et al. 2008, Reich 2012). The Three kinds of biological hierarchies that form the basis of biological integration and scaling. A) Physiological: Hierarchy of functional or metabolic units within a plant from genes, metabolites (small molecules) to organelles, cells, leaves to the whole plant. B) Evolutionary: Hierarchical organization of the tree of life from individuals to lineages. C) Ecological: Nested spatial scales showing that ecological and biogeographic processes-from density-and frequencydependent neighborhood interactions to environmental sorting, dispersal and biogeographic processes-that drive the distribution and diversity of life and shift with increasing spatial and temporal scales.
biosphere and its dynamics in turn emerge from the variation among ecosystems across the globe and their interactions with environmental variation and global change (Theme 5).
We consider emergent properties as observable phenomena that represent aggregate functions of biological systems at each scale of inquiry that result from biological components and their interactions with each other and the environment at the scale below. Emergent properties thus arise from various interactive processes of system components but are more than the sum of the quantifiable parts (Ponge 2005, Reichstein et al. 2014, Bradford et al. 2017). In the proposed Institute we will examine various kinds of emergent properties relevant to each system and scale. We will focus especially on electromagnetic spectral signatures-the patterns of radiant energy (e.g., light) that interacts with vegetation-as one kind of emergent property that can serve as a unifying link across many systems and scales (Homolova et al. 2013, Gamon et al. 2020, Laliberté et al. 2020. Projects both within and spanning these Themes are designed to elucidate the biological components and underlying processes that give rise to the emergent properties of biodiversity-including the emergent spectra-at multiple scales. Projects include common garden studies of model species in which we identify genetic interactions with the environment-including climate, pests, pathogens, and the soil microbiome-and modeling of evolutionary processes within model clades and across the plant tree of life that give rise to functional and spectral trait variation among species (Themes 1 and 2). They also include unparalleled global change experiments (Themes 3 and 4) and NEON-based biodiversity observations (Themes 4 and 5). At the community and ecosystem levels, we experimentally test how species interactions and dispersal lead to biodiversity (taxonomic, phylogenetic, functional, spectral) at multiple spatial scales, with consequences for carbon, water and nutrient cycles under current and future environmental conditions. Finally, at the planetary scale, spectral biology-in concert with machine learning and simulation modeling-provides the potential to represent all of these finer-scale processes within the context of biosphere-atmosphere interactions using models that upscale ecosystem emergent properties and functions to the globe (Theme 5). Collectively, the research Themes and projects within and across them provide a critical platform for training the next generation of integrative biologists. Through graduate student and postdoctoral training, summer training workshops and winter symposia, development of public exhibits and videos tailored to K-12 audiences and the public, and opportunities for international exchange with related programs in Switzerland and Canada, these collective projects provide the opportunity for education and public outreach about life's variation. In doing so, we will engage diverse audiences in new understanding of the connectedness of biodiversity from genetic variation to the tree of life and how its losses affect our human habitat and life support systems. Our highly collaborative and diverse team brings together expertise and skill sets that will enable us to tackle these challenging themes.

Organization and Management Design
The Institute will have a Director and two Co-Directors, responsible for managing the research activities within and across all project Themes (Fig. 3). The Director will take the main administrative role in overseeing the Institute and communicating with NSF. Senior Personnel (Project Leaders) will lead and participate in projects within the research Themes as well as a series of bridging projects that cut across these Themes (Fig. 3). All Project Leaders (including Directors), together with collaborators, postdocs, graduate students and undergraduate researchers, form the Science Team, which will collectively Schematic of the Management Plan, showing management structure and responsibilities of personnel in relation to Research, Education and Training, and Outreach components of the Institute. The Directors will oversee all research, education and training, and outreach activities. Project Leader(s) (shown in bold) will be designated to oversee each of the five thematic research areas along with Project Leaders of specific projects within the thematic area (not in bold). Project Leaders who will oversee bridging and integration activities are shown next to green arrows linking the thematic areas. One Project Leader (shown in bold) will oversee each of the three major outreach activities in collaboration with other Project Leaders and collaborators. All Project Leaders will collaborate in education and training activities. A Managing Director and Co-Managing Director, along with an information management team, will ensure data are managed properly, and will coordinate communication; research, education and training, and outreach activities; the evaluation of those activities; and reporting functions. The External Science Advisory Board contributes to evaluation and provides feedback on research, education and training, as well as outreach. carry out Research, Education and Training, and Outreach activities. Faculty Project Leaders will mentor Institute postdocs and graduate students, incorporate project Themes into teaching activities, and lead and/or participate in outreach activities. U.S. and foreign scientific collaborators will contribute to the Research Activities as outlined elsewhere. The Managing Director and Co-Director will provide efficient management of the institute, overseeing the business and operations of the Institute to ensure fluidity among all aspects from science to operations to evaluation. Specifically, they will coordinate Research, Education and Training, and Outreach activities, oversee reporting responsibilities, monitor progress of the Science Team, contribute to Information Management activities, and facilitate communication among team members.

Leadership and Team
Our highly productive and collaborative team spans biological sub-disciplines within evolution and physiological and ecosystem ecology with expertise in genetics, '-omics' approaches, bioinformatics, global change biology, spectroscopy, and modeling. We conduct predictive modeling in all of these domains. Collectively our team has forged innovative and integrative research in plant biology that has transformed our understanding of plant hierarchies, the interconnection of ecological and evolutionary processes in community and global change ecology and the role of spectral biology in bridging biological scales to understand patterns and consequences of biodiversity. Director Cavender-Bares is an integrative physiological and evolutionary ecologist and a leader in harnessing phylogenetic and spectroscopic methods in ecology. She previously led a NASA/NSF Dimensions of Biodiversity grant with Townsend, Hobbie, Meireles, Reich, and Williams to use spectral biology to link genetic, phylogenetic, and functional dimensions of biodiversity above and belowground across scales , Gholizadeh et al. 2019, Madritch et al. 2014 She also led a related NIMBios working group with Meireles and Townsend. Jointly these efforts built a community of collaborative researchers and led to a series of transformative publications and an edited book that provide a critical foundation for the proposed work. Co-Director Townsend is a leader in spectral biology and remote sensing, specializing in spectroscopic methods and their application to ecology. His team has engineered technological and bioinformatic approaches to using spectral data to predict plant function across spatial scales (Cotrozzi et al. 2017, Madritch et al. 2014, and he is leading a collaborative effort to integrate NEON data. Co-Director Reich is an international leader in functional plant biology and modeling ecological processes within and across biological scales from leaf to the global biosphere , Reich 2014, Huntingford et al. 2017, Pellegrini et al. 2018. He has established several of the world's leading and longest running global change experiments with future carbon dioxide, temperature and/or rainfall levels (Reich et al. 2018b, Thakur et al. 2018, Thakur et al. 2019) and led scaling approaches that model leaf to global scale plant function (Madani et al. 2018, Moreno-Martínez et al. 2018). Our overall Team comprises (1) Project Leaders (faculty and postdocs, see List of Participants) who will oversee each of the five Research Themes, projects within those Themes, and education and training activities (Fig. 3); and (2) staff who will collaborate with Project Leaders on outreach activities. In addition, US and foreign collaborators will participate in various ways including participating in and leading training exchanges. Reich, Hobbie, Cavender-Bares, and Isbell have worked together on biodiversity and global change research for many years at the Cedar Creek Ecosystem Science Reserve NSF LTER site (CCESR), including using spectroscopy to detect ecological processes, while Townsend, Desai and Pavlick are developing the capacity to use NEON's imaging spectroscopy to map foliar functional traits at all NEON sites . Through our prior collaborative grants, the UMN and UW teams have developed a strong ongoing collaboration that forms a formidable intellectual foundation and a positive social community for integrative biology in the Upper Midwest. Despite our history of collaborations, our synergistic team has never before put the full set of its complementary expertise together to address the causes and consequences of changes in biodiversity across scales.
The External Scientific Advisory Board (ESAB), composed of eminent scientists, will meet annually and provide strategic guidance, vision, and accountability for the Institute (Fig. 3). They will assist the Directors by evaluating research themes/projects and advising on scientific matters. Board responsibilities will include advising on Institute priorities, reviewing and evaluating Institute projects, and helping define and evaluate Institute performance metrics. This board will not have formal authority to govern.

Risk Management Strategies
The Directors and Project Leaders have a strong history of collaboration, inclusive authorship, and promoting early career scientists to take leadership on publications. If a scientific disagreement arises, we will attempt to resolve it through collegial discussion or seeking guidance from colleagues. In the unlikely event mediation is needed to resolve a dispute, Dr. David Greenstein, Associate Dean for Research in the College of Biological Sciences at UMN will provide mediation. If a project ends (see Feedback and Evaluation), the Directors will jointly determine how best to reallocate resources to optimize meeting the overall Institute goals. If any dispute arises that cannot be resolved by mutual agreement after meetings and mediation, it will be finally settled under the Rules of Conciliation and Arbitration of the American Arbitration Association by one arbitrator appointed in accordance with the Rules.

Feedback and Evaluation
The Institute will monitor projects through a two-pronged approach. First, projects will be monitored throughout the year by the Managing Directors for the projects' progress towards completing milestones and meeting objectives as delineated in project work plans. The Directors and Managing Directors will have regular (minimum quarterly) check-ins with Project Leaders to assess progress. When milestones are not being met, the Directors and Managing Directors will determine the adjustments required to get the project back on track. In some instances, project delays may be due to conditions beyond the Project Leader's control. In these circumstances, the delay will be documented and necessary adjustments made. If the delay were to result in an inability to complete the project, a meeting would be held among the Institute Directors and the Project Leader to determine if the project should end. Second, a formal annual assessment process will be led by the Managing Directors with the ESAB and experts recruited specifically for individual projects serving as a Research, Education and Outreach Evaluation and Advisory Panel (REAP), coincident with an annual symposium (see below). An approach similar to the mid-term reviews of NSF LTER projects will be adopted. Each panelist will review all projects in their assigned area. The review process will include a two-step process: 1) REAP members will review and score projects independently and 2) the Institute will convene REAP members via conference calls to discuss and provide final recommendations for project advancement and adjustments. REAP recommendations will be reviewed by Institute Directors. Evaluation criteria will explicitly include biology integration advancement, integrative training advancement and transdisciplinary collaboration. In addition, the research projects will be reviewed and scored for milestone completion and deliverables achieved (or on target). Prior to the annual REAP assessment, Education and training efforts will be assessed by a faculty team at UMN and UW, led by Willis as part of the highly acclaimed Biology Teaching and Learning Department at UMN. The process will include an assessment of pre-defined learning outcomes, learning success indicators, and students' sense of inclusivity and confidence pre-and post-participation. Post-doctoral evaluation is addressed in the Post-doctoral Management Plan. Outreach evaluation processes for the Bell Museum, Cedar Creek and Market Science activities will be led by Cavender-Bares in collaboration with Menninger and Potter and evaluation of Minute Earth activities will be led by Reich in collaboration with Neptune Studios; both have wellestablished evaluation procedures in place. Trowbridge will lead evaluation processes for engagement and training activities with the Wisconsin Tribal Conservation Association. These evaluation procedures will be reviewed by the REAP and the ESAB. Education/ training and outreach efforts will be adjusted based on review and assessment.

Evaluation Results
For multi-year projects, reviewers will use the annual review scores and assess the project goals, deliverables, methods, and feasibility for the upcoming year. If the project scores high and reviewers deem the project plan feasible, the project will continue. The evaluation plan will serve to assess the overall success of the Institute and inform and adjust Institute priorities and investments. It is intended to foster program transparency, connect multiple evaluation activities, and provide a comprehensive document that articulates the plan components.

Annual Symposium
Every winter, the Managing Directors and Project Leaders will organize a symposium to be held at UMN or UW in alternating years in which all postdocs and students will present research results. The ESAB will attend these events in person, if possible, or via video when necessary. The Managing Directors and Project Leaders will prepare an annual report documenting results within each project Theme and include abstracts of each subproject. These will be presented orally or with posters at the symposium. One day of the symposium will be open to the public, and an ESAB member may be invited to speak.
The ESAB will provide an annual evaluation of the Institute's progress and offer feedback for improvement. ESAB members were chosen based on their experience in managing projects of similar scope and scale.

Conceptual Background and Current Evidence
The problem. Biology has increasingly fragmented into subdisciplines, posing challenges to uncovering unifying principles and integrating understanding across biological scales. This lack of integration is particularly problematic for understanding evolutionary sources of biological diversity, its local and global responses to global change threats, and the consequences of these changes for ecosystem functions and biosphere dynamics. Such understanding is critical for managing threats to life support systems. We propose to advance integration of biology by examining plant biological variation in response to environmental variation across a range of scales, using a common conceptual framework and set of integrative tools. Specifically, spectral observations at each scale will serve as information-rich inputs to predictive models developed from theoretical understanding of biological processes to yield unifying principles that are critical to advancing knowledge of biodiversity change and its impacts. We view biodiversity as multi-faceted, encompassing biological heterogeneity at levels from genes to whole plants, species, communities, ecosystems, landscapes and the globe. We investigate the processes by which life scales across hierarchies, space, and time and ask how biodiversity at one hierarchical level interacts with environmental variation to influence emergent properties at the next higher scale (Fig. 1). We hypothesize that components of biodiversity at one level drive processes that lead to emergent properties at higher scales and test the extent to which spectral biology offers a unified lens for detecting such properties. Integrating diverse theoretical frameworks and multi-scale measurements, we aim to advance modeling to predict responses of vegetation composition and biogeochemistry to global environmental change at various hierarchical, spatial, and temporal scales. To address this aim, we propose linked synergistic projects that fall within and across five core Themes (Table 1, Fig. 1, Fig. 4).
The spectral approach. We use 'spectral biology' to detect diversity at each biological scale. Spectral information will be coupled with genetic, physiological, community, and ecosystem data and integrated using process-based models to predict processes and patterns across scales. 'Spectral biology' harnesses information from plant electromagnetic spectra-the patterns of light absorbed, transmitted, and reflected at different wavelengths -that are aggregate indicators of the chemistry, morphology, and physiology of leaves, canopies, communities, and ecosystems (Fig. 5). For example, leaf and canopy chemistry and morphology influence their 'spectra', but recorded within these spectra are the products of evolution and ecology , Meireles et al. 2020a). The five project Themes-1. Genetic and environmental drivers of trait variation, 2. Evolution of variation in traits across the tree of life, 3. Community assembly dynamics, 4. Biodiversity consequences for ecosystems, 5. Advancing tissue-to ecosystem-scale modeling of Earth's land surface-and the bridging projects (BP) that integrate them, indicated by connecting arrows.

Figure 5.
Electromagnetic spectra of plants and ecosystem components with an indication of the underlying biochemistry from . Specifically, plant pigments, macronutrients, leaf water content and water potential, lignin, cellulose, nonstructural carbohydrates, leaf mass per area, and polyphenols can all be detected spectrally (Homolova et al. 2013, Jetz et al. 2016. Spectral biology can be applied at the leaf (Jacquemoud and Ustin 2019) or canopy level using proximal, airborne, or spaceborne imaging spectrometers , Foerster et al. 2019). Note that measurements at leaf and canopy scales are comparable but not equivalent, due to canopy structure and variations in sun-sensor-target illumination geometry. The Institute will work with spectral measurements at leaf and canopy scales, but is not focused on scaling spectra between these scales as with radiative transfer models such as PROSAIL ). To harmonize spectral data across projects, processing will employ a common workflow regardless of source (see Research Resources).

Building predictive models for associating spectral signals with functional diversity within and among species and ecosystems.
Spectral biology is powerful because it provides concurrent insights into multiple factors that shape plant and ecosystem function. The same spectra that can predict chemical and morphological features in plant leaves and canopies (Asner and Martin 2016, Asner et al. 2017) have evolved across the tree of life (Meireles et al. 2020b), with some wavelengths and traits predicted by phylogenetic relationships (e.g., chemistries that respond to pests and pathogens) and others relatively constrained among all lineages (e.g., those related to photosynthetic pigments). We have barely begun to scratch the surface of the interacting evolutionary, genetic, and physiological processes that underlie spectral patterns. Our Institute will fill knowledge gaps in what drives spectral, and thus functional, variation within and among species, and then exploit insights to inform models of how individual taxa, communities, and ecosystems respond to global change.

Scaling from leaf-level functions and community interactions to ecosystem
properties. An extensive literature on "biodiversity and ecosystem function" (BEF) , Liang et al. 2016, Hooper et al. 2005, Cardinale et al. 2012, Loreau and Hector 2001 indicates that diversity has a net positive impact on a range of ecosystem services and that current losses in biodiversity may have wide-ranging negative consequences for humans (Pecl et al. 2017). Diversity-in its many forms-enhances the ability of ecosystems to efficiently capture resources and bestows 'stability' on ecosystems over time (de Mazancourt et al. 2013). Genetic, taxonomic, phylogenetic and functional diversity in plant communities are all critical components of diversity and inextricably linked to soil biota and other trophic levels (Bardgett and van der Putten 2014). Trait diversity, plantplant interactions, and plant-soil feedbacks represent dimensions of biodiversity that can drive ecosystem processes (Reich 2012), including responses to global change (Reich et al. 2014, Reich et al. 2018b). Spectra can be used to detect neighborhood interactions and reveal ecosystem properties-such as enhanced productivity due to interactions among multiple species-that emerge from diversity effects   (Fig. 6).

Linking biodiversity-ecosystem function relationships to global biosphere dynamics through Earth system models (ESMs).
ESMs, currently our best means of predicting biosphere responses to global change, dramatically oversimplify the functional diversity of vegetation by lumping all vegetation into a small number (≈ 4-14) of plant functional types (PFTs). Moreover, each PFT (e.g., temperate broad-leaved deciduous or tropical evergreen rainforest trees) is prescribed a single value (the mean of available measurements) for critical traits. Consequently, most models treat enormous areas of biodiverse vegetation as functionally and chemically identical. In reality, there is considerable heterogeneity of traits within and among ecosystems across Earth . Given that biological processes, including those that drive carbon cycling, are non-linear with respect to trait values, and depend on species interactions, mean traits are insufficient to accurately describe ecosystem behavior. Our Institute will use a combination of spectral data, trait data, machine learning, and modeling to scale trait functional diversity to the global scale.

Core Themes and Synergistic Projects
We outline a set of synergistic collaborative projects within and across five Themes (Figs 1, 4) and then describe a set of bridging projects (see Fig. 4) which will integrate among the Themes. (Fig. 7).

Theme 1. The genetic and environmental drivers of trait variation at the leaf and whole plant scale linked by transcriptomic, metabolic, and morphological variation
Spectroscopy has the potential to document genetic variation , Yendrek et al. 2017) as well as responses to abiotic or biotic stresses (Mo et al. 2015, Moghimi et al. 2018, Sytar et al. 2017, Tirado et al. 2020), but the underlying mechanisms remain poorly understood. Genetic variation and transcriptomic responses to environmental factors affect observable phenotypes (which can be measured as spectral properties) via biochemical mechanisms that affect the metabolome. We will identify spectral features associated with variation in these molecules to gain insights into the Remote spectroscopic imaging of a tree diversity experiment allows us to assess diversity effects on spectral reflectance, functionally important canopy traits (e.g. %N) and ecosystem functions (e.g. productivity) as well as underlying ecological processes. By simulating the spectra of mixed-species stands from monocultures, we can calculate the spectral net biodiversity effect (following a field-based approach) and partition novel, ecologically informative contributions to this net effect, such as spectral dominance and spectral plasticity effects. In this hypothetical example, two species (circles, triangles) are depicted in monoculture, simulated mixtures, and observed mixture. Each assemblage is illustrated in cross-section, from overhead (grid represents pixels), and as a mean reflectance spectrum. underlying physiological and metabolic states of living tissues. We will use 'omic' technologies to generate high resolution data on variation in transcript and metabolite abundance among genetically different individuals to understand within-species spectral variation (Madritch et al. 2014, Vergara-Diaz et al. 2020, which can then inform understanding of the sources of within-species variation in responses to global change. To do so, we will leverage genetic and experimental resources suitable for transcriptome and metabolome analyses, exploring responses to global change factors for maize and oak (1.1); assessing genotype x environment (GxE) interactions in responses to climatic and biotic pressures in tree (oak, beech, two pines) and crop (potato, corn) common gardens (1.2); and assessing mature tree responses to drought and pest/pathogen exposure in an existing experiment (1.3). Bioinformatic approaches will be used to uncover mechanisms driving correlations between genetic, metabolic, physiological, climatic, and spectral data.

Abiotic global change experiments in controlled environments.
In growth chambers, we will assess GxE interactions on spectral traits by growing seedlings of at least ten different genetic backgrounds (i.e., genetically identical individuals for crops, or siblings within a seed family for trees). Seedlings will be exposed to different environmental stresses including control, cold, heat, and salinity. Also, replicated plantings of corn and oak from each of 10 genetic backgrounds will be placed within the rings of the BioCON experiment at the Cedar Creek LTER to impose the same CO treatments as in Theme 3.

Common gardens across climatic and biotic gradients.
We will explore traits connecting genetic variation and GxE interactions to spectral variation by leveraging four existing sets of studies: (1) Pinus ponderosa genetic trials (200 half sib families) in five sites across MT and ID and a seed orchard in WA with 28 genotypes; (2) Georeferenced beech (Fagus sylvatica)-dominated forests, extant biodiversity plantations that include F. sylvatica, a common garden experiment with 100 seed families in Zurich in which climatic factors (water, soil, herbivory) can be manipulated, and 5 common gardens with 20 seed families across a Swiss climatic gradient (these experiments, campaigns, and sample/data analysis are fully supported by the UZH and URPP-GCB); (3) The Adaptation to Climate reciprocal transplant Experiment (ACE) that has 60 seed families of bur oak (Quercus macrocarpa) growing in a nursery to be planted in common garden sites across a climatic 2 Figure 7.
We will examine the mechanistic basis for spectral and functional variation within a species, scaling from the variation in genes (left), transcriptomes, and metabolites to the variation that emerges in leaves, canopies, and genetically distinct individuals exposed to contrasting environments (right). gradient in MN, IL, and OK; and (4) 9 potato trials established in 9 states with 10 genetic backgrounds (USDA-NIFA funded). We will identify plant responses (physiology, yield, quality) to soil microbiome diversity, pathogen inoculum, and pathogen suppression, leveraging data on microbial community composition via sequencing of prokaryotic 16S-V4 and fungal ITS2 rRNA gene regions. For each of these sites we will leverage micrometereological data and collect leaf and whole canopy spectra and species-specific phenotypic data to associate with metabolomic and transcriptomic profiles.

Field manipulation experiment.
We will further leverage a drought experiment at the Sevilleta LTER near Socorro, NM focused on pine (Pinus edulis) that uses rainout shelters to divert 45, 75, and 90% of ambient precipitation away from trees. We will access micrometeorology, soil moisture, and sap flow data, as well as PhenoCam photos, chemical defenses, non-structural carbohydrates, physiological measurements, and insect community composition. Leaf and whole canopy spectral traits will be collected immediately before tissue samples are collected for transcriptome and metabolomic profiling.
Methods for linking transcriptomics, metabolomics, and spectral reflectance. For all Theme 1 projects, we will generate RNAseq based transcriptomes and metabolomic data using established protocols. For each RNAseq sample, we will generate ~20M RNAseq read pairs on an Illumina platform, to determine transcript abundance for each gene. For metabolomic profiling, pigments, natural products, and small signaling molecules (e.g., defense compounds, hormones) will be analyzed using liquid chromatography (HPLC-UV) at UMN and mass spectrometry-based profiling (UHPLC-MS/MS profiling) (Sedio et al. 2018) at UZH by collaborator Schuman and UW. Spectral, transcriptome, and metabolite traits for each species will be analyzed using ANOVA to estimate G, E, and GxE effects. In both the CO and drought field experiments, we will fly the UAS with HySpex full-range imaging spectrometer 3x per growing season. The experiments in Theme 1 will allow us to examine how the abiotic environment or soil microbiome affects plant spectra, and by extension, chemistry, physiology and/or disease expression. We will identify wavebands that are strongly influenced by the relevant environmental factors in each species and assess the relative impacts of genotype and environment. Analyses of transcript levels within, and among, environmental conditions relative to spectral traits will identify associations between spectral reflectance and gene expression. Genes that exhibit association with spectral traits will be used to perform gene ontology or pathway analyses to identify metabolic pathways or physiological processes that are associated with variation in spectral traits Vergara-Diaz et al. 2020). Metabolomic profiles will similarly be analyzed within and among environmental conditions in relation to spectral traits. Results of these two analyses will be used to identify overlap in transcriptomic and metabolite changes which may explain variation in spectral traits.
Theme 2. How evolution generates the functional and spectral variation across the tree of life and its utility for biodiversity detection (Fig. 8).
Functional and spectral variation among species and lineages is critical for remotely detecting biodiversity in plant communities. Although such variation arises from macroevolutionary processes that involve convergence, divergence, and constraints in plant structure and chemistry, the effects of evolution on plant spectra are poorly understood. By coupling macroevolutionary models and physical models of leaf spectral properties (Fig. 9A), we can assess how evolution affects plant spectra (Meireles et al. 2020b), including selection following habitat shifts, rapid diversification, and evolutionary constraints in traits (Fig. 9B). We will make novel inferences about the evolution of plant function and spectra using two complementary approaches: (1) macroevolutionary modeling across the seed plant tree of life using compiled spectral and trait datasets (2.1.1; breadth-focused, sparse data) and (2) analyzing the mechanistic links among genetic, metabolic, and spectral variation and the macroevolutionary process by comprehensively sampling two model clades (sensu Knapp et al. 2004, Buell 2009, Cavender-Bares 2019), oaks (Quercus) and pines (Pinus) (2.1.2, depth-focused, dense data).

Sampling across the tree of life.
We will examine the origins of diversity in functional and spectral traits observed across the plant tree of life using multivariate Ornstein-Uhlenbeck models to estimate evolutionary rates, constraints (or directional The genetic, structural, and biochemical bases for spectral variation within species (left) help illuminate the mechanistic basis for variation among species and the evolutionary processes that have shaped functional variation across the tree of life (right).

Figure 9.
Integration of trait evolution and leaf spectral models enables estimation of evolutionary parameters from spectra and simulation of leaf spectra along a phylogeny. Ancestral leaf attributes evolve along a phylogeny, generating the current leaf attributes that underlie spectra. From the evolved leaf attributes, radiative transfer models estimate spectra that carry the signature of the phylogeny (A). This framework allows us to infer the evolutionary dynamics of spectra and to predict spectral responses to evolutionary pressures (B) as well as to compute the probability that an unknown spectrum falls anywhere on the tree of life (C). evolution), lability, and coordination in plant traits (Uyeda andHarmon 2014, Clavel et al. 2015). Existing leaf-level spectral datasets (Meireles et al. 2020b) (see research resources) will be expanded through collections from: (1) collaborator Laliberté at the Canadian Airborne Biodiversity Observatory (CABO) (grasslands and forests in temperate and boreal regions), (2) collaborator Coomes (>280 tropical tree species in Borneo), (3) Project Leader Meireles (300 Neotropical species in the Brazilian Atlantic Rainforest). We will also assemble spectral profiles for tree canopies in multiple arboreta for multiple individuals of ~300 species. Trait data will be measured, obtained from the TRY database (Kattge et al. 2020) and/or predicted from spectra using the PROSPECT model (Féret et al. 2017). Finally, we will obtain the phylogenies for study species by subsetting existing plant mega-phylogenies Brown 2018, Li et al. 2019).

Comprehensive sampling within two model clades.
Using comprehensive species sampling in two broadly distributed lineages, we will explore the mechanistic links between genetic, metabolic, and spectral variation and long-term evolution. We will measure leaf-level spectra from living collections for ca. 300 (of 450 known) Quercus species and ca. 60 (of 120 known) Pinus species from arboreta in the US, France, England, and Mexico. Mature leaves will be collected and flash-frozen following established protocols for RNA-seq Yang et al. 2017, Sedio et al. 2018 and metabolite analyses , Sedio et al. 2018; see Theme 1). Spectral data will be accompanied by species distribution and environmental data layers (Waltari et al. 2014, Fick and Hijmans 2017, Pinto-Ledezma and Cavender-Bares 2020, Randin et al. 2020) to characterize species' environmental niches. We will perform phylotranscriptomic analysis to enhance currently available phylogenies (Saladin et al. 2017 and to reconstruct the history of gene family evolutionYang et al. 2017. Macroevolutionary analysis of those data (following methods in 2.1.1 and Sedio 2017) will allow us to estimate whether the genetic, metabolic, and spectral signals detected in shortterm stress response have long-term macroevolutionary signatures in habitat specialization, molecular evolution of genes, metabolite evolution, and spectral profiles.

Biodiversity detection and modeling.
We will spectrally identify species, lineages, and community types through remote sensing across scale (Lausch et al. 2016, Malenovský et al. 2019 in two complementary ways. First, we will expand on a prior NSF-NASA project to develop spectral libraries of remotely sensed tree canopies in temperate forests and arboreta, using statistical and machine learning methods to identify species from airborne data, including from NEON. For species with no prior spectral data, we will place them into lineages based on spectral evolution models (project 2.1, Fig. 9). Second, we will build novel models that jointly predict species distributions and community composition (Randin et al. 2020). These models will incorporate abiotic factors (Lausch et al. 2019) and remotely sensed dynamic vegetation components (leaf area index, traits (Czyż et al. 2020), and phenology (Garonna et al. 2018) to predict species distributions (Hobi et al. 2017, Radeloff et al. 2019, Pinto-Ledezma and Cavender-Bares 2020. Models (Thonicke et al. 2019) will be compared with direct detection methods to develop an approach for biodiversity detection and changes in community composition.
Theme 3. How functionally and spectrally distinct taxa, sampled across the tree of life, interact locally, leading to the assembly and dynamics of communities at multiple spatial scales, under current and future environmental conditions (Fig. 10).
The composition and diversity of ecological communities emerge from species interactions within and among local communities. To address Theme 3, we will leverage (1) long-term forest and grassland manipulations of global change factors and (2) a new dispersal experiment in grasslands. These experiments allow the assessment of how trait variation, individual species performance, and plant-plant interactions drive community assembly processes and associated diversity. As platforms for this research, we will use ongoing long-term open-air field experiments in Minnesota (9 to 23 years-old) in which CO , temperature, rainfall, N deposition, and/or plant biodiversity are manipulated [B4WarmED, IDENT (IDENT-Cloquet, FAB-1 and FAB-2), BioCON, TeRaCON] (Reich et al. 2018a, Reich et al. 2018b). We will also use a new experiment, RESCUE, that manipulates dispersal to test meta-community theory predictions.

Grassland global change experiment.
In the nested multi-factor (temperature, rainfall, CO , N and diversity) BioCON/TeRaCON experiments, we will test (1) whether outcomes of species interactions in mixtures under global change can be predicted from traits of individual species observed in monocultures, and (2) how global changes alter local coexistence, favoring some species over others. For (1) we will assess whether species that drive resources (nutrients, light, water) to the lowest levels in monoculture outcompete other species in mixtures, as predicted by resource competition theory (Tilman 1982), and for (2) whether global changes disrupt coexistence and shift competitive outcomes by altering limitations to plant growth and survival, as predicted by multiple resource limitation and niche theory (Holt et al. 1994, Chase andLeibold 2003) and observed at this site. Multiple interacting global changes can lead to complex outcomes over decades; for instance, elevated CO initially promoted C3 grasses, as expected, but later promoted C4 grasses (Reich et al. 2018a). In ≈100 plots, we will repeatedly map plants in 10x10 cm grids (324/plot) and align these measurements with hyperspectral data collected via drone. Preliminary data suggest that even closely related species are identifiable by spectral signature. We will use these data to determine species-species 2 2 2 Figure 10.
Evolutionary processes involved in generating the variation within and among species across the tree of life (left) provide the raw materials-taxa and their functional traits-for the processes (growth, survival, dispersal) by which individuals interact, leading to coexistence or to competitive exclusion and dominance, in local communities and metacommunities (right). spatial co-variances under contrasting global change treatments, explore underlying mechanisms (e.g., competition, facilitation, plant-soil feedbacks, niche complementarity), and examine how composition, diversity and spatial structure influence community-scale responses (such as diversity).

Metacommunity grassland experiment.
In RESCUE we will manipulate species dispersal, a metacommunity process (Loreau et al. 2003, Leibold et al. 2004, Leibold et al. 2017) that partly determines plant diversity at local to regional scales. We will test whether (1) habitat loss treatments decrease plant diversity by inducing dispersal limitation, and (2) seed addition treatments alleviate dispersal limitation and maintain high levels of plant diversity. Dispersal determines the spatial scaling of biodiversity by creating spatial betadiversity (i.e., turnover in species composition across communities), which determines the extent to which gamma diversity exceeds local-scale alpha diversity. Plant community mapping and remote sensing will be conducted as in BioCON.

Forest diversity and global change experiments.
In the B4WarmED , Reich et al. 2018b, Thakur et al. 2018 and IDENT experiments (Grossman et al. 2017), we will assess how the traits of focal species and the phylogenetic, functional, and spectral composition of neighboring trees influence the survival, growth, and reproduction of each species and its response to competition and increased temperature, drought, and diversity loss. We will test whether traits and plant-plant interactions explain competitive outcomes, evaluating how differential responses to global change influence community composition. Moreover, using 11+ years of survival and growth data, we will study how trait values affect species' performances within neighborhoods and, in turn, predict shifts in community composition and structure through time under global change factors. (B4WarmED and IDENT). In a subset of Theme 3 studies, we will use leaf-scale daily and seasonal physiological relations , Reich et al. 2018b, Stefanski et al. 2020 and spectral profiles to model total annual carbon balance per leaf and plant in response to temperature and rainfall change (B4WarmED) and to diversity of neighbors (IDENT). This will illuminate mechanisms that drive competitive outcomes, which drive community assembly processes. We will assess whether whole-plant carbon balance estimates (benchmarked against growth data) explain competitive outcomes and community composition.

Neighborhood carbon cycle
Theme 4. The consequences of biodiversity for ecosystem functioning and its response to global change (Fig. 11).

Community Scale Traits and Biogeochemical Cycles.
In all seven global change experiments (Theme 3), we will (1) test whether the mean and variance of communityscale traits explain system-scale C and N cycling and its responses to global change factors (Reich 2012, Reich and Hobbie 2013, Ellsworth et al. 2017), (2) use models to simulate NPP from community traits, benchmarked against biomass measures (Ali et al. 2013, Ali et al. 2015 and identify the mechanisms by which functional diversity drives ecosystem function, and (3) assess whether spectral signatures of individuals, neighborhoods, and communities enable us to detect and model such responses to global change factors . For example, spectral profiles of plant canopies (Theme 2) will be used to identify functional traits, including canopy nitrogen, and to predict outcomes of plant-plant interactions in terms of ecosystem productivity ). Our previous work shows that remote spectral detection of the traits and taxonomic identities of plant canopies can accurately predict neighborhood diversity and net effects of biodiversity on ecosystems functions , Fallon et al. 2020.
These experiments (1) represent some of the longest-running, most complex, wellreplicated, multi-factor, and ecologically realistic global change experiments of their kind, (2) contain existing taxon-specific trait data (including spectral data) and will be platforms for generating new data of this type during the BII, and (3) provide a rich data infrastructure, as a number of ecosystem properties and processes (e.g., biomass production, soil C flux, soil N cycling, plant and soil C and N pools) have been measured in every plot; allowing benchmarking of process models that use emergent properties (including spectrally derived ones) to predict ecosystem scale responses to global change factors (Reich et al. 2018a, Reich et al. 2018b. Given our extant capacity, we can directly learn much about long-term ecosystem responses to global change from these systems and via our spectral scaling approach, helping to develop methods for assessing response to global change in the unreplicated whole-earth experiment we are unintentionally implementing.

Plant-soil Interactions.
Plant-soil feedbacks are interactions among plants, soil organisms, and abiotic conditions that influence plant performance, diversity, and community structure, ultimately driving ecosystem processes (Delgado-Baquerizo et al. 2020). We will study how interactions between plants and soil biota can lead to complex feedbacks that regulate ecosystem processes. We will use B4WarmED to examine how soil microbial composition and diversity varies with temperature and rainfall manipulation, and whether they can explain C cycling benchmarked with litter decomposition, soil organic matter decomposition, and soil CO flux. We will predict bacterial C-use efficiency (the proportion of C retained in biomass relative to total C uptake; Saifuddin et al. 2019) for the soil metagenome using genome-specific constraint-based metabolic modeling, obtaining 2 Figure 11.
The integration of evolutionary and community assembly processes (left) result in emergent properties of communities (e.g. leaf area index, canopy %N) that drive ecosystem processes (right) and their responses to global change. To address Theme 4, we will leverage the same set of long-term field experiments described above (Theme 3), complementing existing data with additional modeling and measurements.
draft GEnome-scale Metabolic reconstructions (GEMs) from metagenomes using the DOE KBase automated platform. The GEMs will be quality checked by the test suite 'memote' (Lieven et al. 2018) and metabolic interactions within the soil microbiome will be simulated under different conditions (Orth et al. 2010, Chan et al. 2017a). We will test whether microbiomes in ambient and treated plots can be characterized by individual or emergent traits of the microbiome. We will link metagenome-based predictions to biogeochemical models by coupling the PLFOTRAN model with the Community Land Model (CLM, closely related to E3SM), leveraging a new pipeline that identifies key biochemical reactions. Such coupling will enable next-generation models of soil biogeochemistry that take into account plant-soil-microbiome feedbacks across scales to predict soil C emissions and soil C pool sizes.
Theme 5. How to improve parameterization of tissue-to ecosystem-scale properties at various spatial scales and advance land surface models that incorporate plant functional diversity (Fig. 12).

Functional diversity and ecosystem function in NEON.
Using hyperspectral imagery, we will quantify functional diversity spatially to develop and test BEF relationships, to inform incorporation of biodiversity into global land surface models. First, we will leverage flux tower and hyperspectral imagery from the NEON to identify relationship between functional diversity (FD) and gross primary production (GPP) at broad scales (see Research Resources section). We will calculate FD following Schneider et al. (2017)   ) and relate FD to GPP, a key output of terrestrial biosphere models, estimated from NEON flux towers. We will follow standard approaches for flux tower QA/QC and subsetting trait data to flux measurement area , Wutzler et al. 2018. Previous work has shown that airborne hyperspectral imagery is suitable to predict GPP (Wutzler et al. 2018); here we will link traits and trait diversity to GPP across multiple years and all NEON biomes. We hypothesize that systems with higher LAI, biomass (derived from NEON lidar), and traits associated with growth such as nitrogen will be more productive, and sites with greater functional diversity will exhibit both higher GPP and comparatively greater stability of within-season and between-year GPP, controlling for climate drivers. Building on ecosystem scale estimates of plant functional diversity (left), we will test the effect of diversity on ecosystem functioning across large spatial extents, including by constructing global maps of functional diversity and incorporating processes described in Themes 3 and 4 into global scale models to simulate the influence of diversity on biosphere dynamics (right). (Oehri et al. 2017, Oehri et al. 2020. With collaborators at UZH, we will test the hypothesis that increasing landscape diversity-measured spectrally as the number of land cover types within a landscape-increases landscape functioning and stability (Oehri et al. 2020). Within each NEON ecoclimatic domain we will delineate patches of different types in a specific area that can be described in terms of their shape, form, and pattern (Forman 1995, Heffernan et al. 2014, Oehri et al. 2020. Landscape diversity will be calculated as the intersection of these regular landscapes with the U.S. Land Cover Map  to estimate the number of land cover types within landscapes. Landscape functioning (GPP) will be measured using continuous and high spatial resolution satellite data products (e.g., MODIS EVI, GPP, SIF) to estimate Net Biodiversity Effects (Loreau and Hector 2001) within each regular landscape across the ecoclimatic domains. We will use a multilevel modeling approach (Gelman and Hill 2006) where GPP is a function of landscape diversity and the ecoclimatic domains are random variables.

Diversity and function at the global scale.
At the global scale, incorporation of BEF relationships into models of current vegetation function will require remote sensing for both the characterization of functional diversity and 'ecosystem function' (as opposed to flux towers, as above), and into prognostic models of future vegetation will require modeling (rather than measuring) future trait diversity. For the former, we will address the potential to measure BEF relationships entirely using remotely sensed data, leveraging numerous datasets . For this study, function will be measured using: 1) GPP inferred from multi-sensor (GOME, OCO-2) harmonized spaceborne solar-induced fluorescence (SIF; Li and Xiao 2019, ) and OCO-3 SIF; and 2) estimates of evapotranspiration (ET) derived from EcoSTRESS (Fisher et al. 2020). To characterize functional diversity, we will use airborne hyperspectral imagery coincident with the spaceborne SIF and ET data from the NEON AOP (see above) and NASA's AVIRIS-Classic and AVIRIS-Next Generation (including campaigns over California  . We will test relationships between traits and remotely sensed measures of function (SIF GPP and ET), addressing means, variances and metrics of diversity derived from trait maps.
For simulating diversity effects under global change, both traits and responses must be modeled. Spectral data will be incorporated into terrestrial biosphere models through the generation of global gridded maps of trait diversity and their variation with environmental change, leveraging hyperspectral-derived trait maps and ground-based data. Spectral data will be used in conjunction with a global database of in situ trait measurements (TRY; Kattge et al. 2020), as well as global environmental data, to generate uncertainty quantified global maps of plant traits, building on our prior work ) and utilizing recent advances in deep learning (LeCun et al. 2015), especially deep generative models to quantify uncertainties accurately and flexibly (Hinton et al. 1995, Blei et al. 2017, Kingma and Welling 2019. First, we will use an encoder-decoder network (EDN) trained (Kingma and Welling 2019) simultaneously using quantile losses at multiple quantiles so that the model is accurate not just near the mean but the entire range of the distribution (Koenker 2005, Gonçalves et al. 2016). Second, we will use Gaussian processes (GPs) (Rasmussen and Williams 2005) which we have successfully used in prior global modeling of key plant traits . Unlike traditional approaches, the covariance function will be based on learned deep representations rather than pre-specified parametric covariance functions. These maps will form the foundational dataset to incorporate functional diversity into ESM estimates of land surface carbon uptake. Initial results from FLUXNET flux tower site simulations showed that incorporating one aspect of functional diversity, a range of trait values rather than a PFT mean, can drive changes in total carbon uptake of 10% in the tropics and over 30% in the Arctic (Butler et al. 2021). We will apply this modeling protocol for the NEON flux tower sites as well.
Our site-based simulations provide the basis for incorporating trait distribution maps and novel processes into simulations of the global carbon cycle and will be evaluated against: 1) the ILAMB model evaluation metrics (Collier et al. 2018), 2) empirical diversityproductivity relationship (Liang et al. 2016), and 3) spectrally derived BEF relationships from this study. Last, we will evaluate the influence of diversity on the future C cycle by comparing a baseline climate change simulation using static functional diversity values to a dynamic trait simulation that updates traits according to empirical relationships between traits and environment from this work.
The analyses proposed here will pave the way for much broader analyses that will be possible once global satellite imaging spectroscopy data for mapping functional diversity become available in the mid-to late-2020s from the planned NASA Surface Biology and Geology (SBG) and ESA Copernicus Hyperspectral Imaging Mission (CHIME) missions. This effort represents an unprecedented integration of ground-based data, remote-sensing data, spectral ecology, trait-based ecology, machine learning, and biodiversity-ecosystem function science and expertise, which together will enable large advances in understanding and quantifying biosphere-atmosphere interactions influencing the global carbon cycle.

These analyses also provide an empirical basis for incorporating functional diversity into terrestrial biosphere models, with spectral biology providing the necessary foundation.
Bridging Projects: Integration across Themes. Bridging research (BP1-2, BP1-3, etc., Fig. 4) will integrate across Theme projects. We outline some of these below (space limits prevent full descriptions).

BP1-2, 1-3.
Intensive species-specific studies (Theme 1) will improve understanding of how functions of species have evolved in association with shifts into habitats and ecological niches that vary in precipitation and temperature. These insights will help us develop generalizations about trait evolution (including spectral traits) across the tree of life (Theme 2) and about species' responses-including those that shape interactions among plants and drive community change-to global change factors (Theme 3).

BP2-3, 2-4.
The macro-evolutionary processes involved in generating variation among species (Theme 2) provide the raw materials-taxa and their functional traits-for the processes (establishment, growth, survival, dispersal) by which individuals interact, leading to coexistence or competitive exclusion, in local communities and metacommunities (Theme 3) and to the aggregate community traits that drive ecosystem processes (Theme 4). We will test whether spectral signatures derived from the tree of life project (Theme 2), and extrapolated to model systems (Themes 3 and 4), predict traits that explain species competitive outcomes (Theme 3) and ecosystem function (Theme 4)-and how spectra and competitive outcomes vary with global change context. We will compare multiple approaches for predicting outcomes of neighborhood interactions and resulting ecosystem functions .

BP3-4.
The outcomes of neighborhood interactions and community assembly processes (Theme 3) will result in the functional composition that drives ecosystem outcomes (Theme 4). Here we will address (1) whether the traits that lead to success for species in competition under varying global change contexts predict the magnitude and variation of ecosystem process rates in each context and (2) whether common spectral signatures predict shared responses at species, community, and ecosystem scales.

BP1-5.
Physiological and spectral relations with climate of origin for bur oak and ponderosa pine (Theme 1) will be incorporated into parameterization of their respective clades at appropriate regional scales in land surface modeling (E3SM or CABLE) (Theme 5). Resulting model output (e.g. GPP) will be evaluated against US (FIA) and Canadian (CFS) inventory data, and/or flux tower data (e.g., ).

BP2-5.
Improved quantification of how spectral signatures map onto the tree of life (Theme 2) will be employed as part of new models of global traits developed from machine learning applied to combinations of existing data bases (TRY) and global satellite spectral data, a key foundation of Theme 5.

BP3-5.
Advances in understanding variation in species performance under experimental environmental change, and how variation in spectral signatures and other emergent properties relate to such performance (Theme 3), will be incorporated into process models (G'Day, STANDLEAP, E3SM) that operate at ecosystem to global scales (Theme 5). Model parameters will be updated using both direct (photosynthesis, growth) and spectral scaling relationships (Theme 3). Similarly, the C cycle-environmental response logic (e.g. temperature response of photosynthesis or respiration) required to integrate and scale from leaf to ecosystem will be improved to better incorporate heterogeneity across taxa and environments , Reich et al. 2018b. Model predictions will be evaluated against quantitative benchmarks such as from flux tower data and ILAMB benchmark standards (Delgado-Baquerizo et al. 2020).

BP4-5.
Conceptual and quantitative advances in understanding how biodiversity influences the magnitude and stability of ecosystem processes, such as plant productivity  or soil multi-functionality (Delgado-Baquerizo et al. 2020) (Theme 4), will be used to advance the treatment of biodiversity-ecosystem function processes at large scales (Theme 5), especially in land surface models (Butler et al. 2021). We will incorporate spatial heterogeneity into E3SM's parameterization and logic as well as overyielding mechanisms by which higher diversity results in heightened productivity . We note that cross-cutting projects BP1-5, 2-5, and 3-5 feed directly into model advances that will be employed in BP4-5.

Research Resources
Our institute makes extensive use of existing NSF research investments and resources, including NEON, LTER, as well as resources funded by USDA NIFA, NASA, and the University of Zurich (UZH).

Theme 1. Common Garden Resources with permission to fly drones. Pine common gardens.
Genetic trials of Pinus ponderosa are maintained by Inland Empire Tree Improvement Cooperative (IETIC), with the same 200 half-sibs (5 replicates) across sites that differ in micrometeorology and disturbance regimes (e.g. drought and bark beetles). The Pullman Ponderosa Pine Seed Orchard (PPSO) was established in May 2007; 28 known genotypes were grafted to seedling rootstocks and planted with 10-16 replicates of each. We also leverage a recently established large-scale manipulative drought experiment (NSF-funded) at the Sevilleta LTER, NM that includes Pinus edulis and consists of five 40 x 40 m plots with control, 45%, 75%, and 90% reduction of rainfall, and a 10-yr legacy plot with 45% reduction. Micrometeorology, soil moisture, tree water content, sap flow, and PhenoCam photos (half-hourly) and monthly collections of needle and twig samples for terpene analysis and non-structural carbohydrate (NSC) analysis, water potential, tree canopy health, and Lindgren funnel collections of the arthropod community in each plot; bole xylem, phloem and roots samples for terpene analysis and non-structural carbohydrates and physiological measurements, and bole and branch volatile organic compound emissions are available. Potato common gardens: Field trials of potato varieties have been established in 9 states (CO, ID, ME, MI, MN, ND, OR, WA, and WI), with 12 m x 4 m plots and 6 agronomic treatments each replicated five times, to be sampled annually through 2024. Data available to us includes microbial community composition in every soil sample via sequencing of prokaryotic 16S-V4 and fungal ITS2 rRNA gene regions, as well as crop health, yield, and quality data. The project is funded by USDA-NIFA. Oak common gardens: The Adaptation to Climate (ACE) experiment is in progress with one-year seedlings of sixty seed families of Quercus macrocarpa to be planted in common garden sites across a climatic gradient in MN, IL and OK. Ten replicates of each of the 60 seed families from the three sources will be planted 1 m apart in a randomized block design at CCSER, Morton Arboretum and U OK Kessler Field Station, supported by CCESR NSF-LTER and UMN. Beech common gardens: Fagus sylvatica common garden experiments are currently being established with 100 seed families in one garden at the UZH, and 20 seed families in gardens across a climatic gradient in Switzerland. Maize inbred lines are currently in the Springer lab germplasm collection and are publicly available through the USDA North Central Plant Introduction Station.

Theme 2. Leaf spectra from the tree of life and the oak model clade.
A database of 560 species (and replicates of species) has been assembled through NSF-NIMBioS (Meireles et al. 2020b) and 500 spectral signatures, leaf tissue and functional traits collected on multiple individuals of over 200 oak species in arboreta in France, England, California and in the Midwest by Cavender-Bares through NSF Dimensions of Biodiversity plus large collections from CABO, UZH and Borneo. Finely resolved phylogenetic trees and their climatic envelopes have been generated through the NIMBioS working group (Meireles et al. 2020b) and the NSF Oaks of the Americas project .

Themes 3 and 4. Tree diversity experiments: IDENT -Cloquet located in northern
Minnesota in which species richness and functional diversity have been independently manipulated; 48 different species assemblages (of 1, 2 and 6 species) were planted in replicated plots. The location of all trees (9408) is known and annually measured since planting (11 census points). An IDENT Canopy Carbon Model estimates species and stand-level carbon assimilation at hourly time steps across the experiment using incident radiation, leaf area, photosynthetic light response curves, and foliar phenology to estimate total C assimilation. The IDENT -Forest and Biodiversity (FAB1) experiment at CCESR has a similar design as IDENT-Cloquet. A larger FAB2 experiment with 10x10 m and 20x20 m plots manipulates taxonomic and phylogenetic diversity of trees from 1, 2,4,6 and 12 species mixtures that vary in functional and phylogenetic diversity; these have been supported by NSF-LTER, NSF/NASA, and UMN. B4Warmed located at two sites in northern Minnesota (Cloquet, Ely) is the world's longest running open-air active warming experiment (2020 will be year 12), and one of only two (the other TeRaCON) that control temperature elevations both aboveground and belowground. It is well replicated (72 plots, >10,000 tree seedlings) and includes factorial combinations of temperature, rainfall manipulation, sites and habitat type (open, understory). It has been supported by the US DOE and Minnesota's LCCMR.
BioCON/TeRaCON at CCESR, is the world's longest running free-air CO enrichment experiment, and includes 16 factorial combinations of CO , N supply, and plant biodiversity in a well replicated (371 plots) 22-year old grassland experiment. TeRaCON is a subexperiment using 48 of the BioCON plots, which deploys 16 combinations of warming, rainfall manipulation, CO , and N supply; it is one of only several such multi-factor longterm global change experiments (2020 will be year 9) and has received considerable NSF support through the LTER, LTREB, MRI, Ecosystems and Biocomplexity programs.
The RESCUE experiment (to be established in 2020 as part of NSF CAREER award to Isbell) will fully cross a habitat destruction treatment (loss of 0%, 50%, 95%, or 99% of local habitat area) with a seed addition treatment (no seeds added or sowing a diverse mix of native grassland species) in 64 780 m plots. The experiment will be located in naturallyassembled grasslands adjacent to BioCON experiment, and include some of the same species, allowing us to build on knowledge from these experimental monocultures and mixtures, and test whether their results and predictions can be extended to naturallyassembled grasslands and larger spatial scales.

Theme 5. NEON Biodiversity and Ecosystem Function capability.
Hyperspectral imagery and derived trait maps as well as flux tower data from the NSF NEON network (Townsend NSF 1638720) will be leveraged, including hyperspectral algorithms and distribution maps of 26 traits plus uncertainties for all NEON sites . Additional analyses will make use of NASA AVIRIS hyperspectral imagery and associated trait mapping algorithms , as well as NASA OCO-3 solar-induced fluorescence and ECOSTRESS evapotranspiration data. The UZH provides remotely sensed 3D reconstructed forest site data for 44,000 trees (Laegern site) and access to the Airborne Research Facility for the Earth System (ARES) to measure terrestrial processes -using spectroscopic imaging, LiDAR and other sensor types-at regional scale, and all data available under the open science data policy as agreed upon in the NASA Space Act Agreement between UZH and NASA JPL.

Energy Exascale Earth System Model (E3SM) and Global Trait Diversity Maps.
The land model of E3SM has been run at UMN. E3SM features subroutines to estimate the influence of nutrients and biogeochemistry on the carbon cycle. Global maps at 0.5º resolution for specific leaf area, leaf nitrogen and leaf phosphorus are available ) Each pixel has a log-normal distribution of trait values described fully by a unique mean and variance. This data product is ready to be incorporated into E3SM to conduct simulations of functional diversity and productivity. Work on E3SM was supported as part of the E3SM project, funded by the US Department of Energy.

All Themes:
Spectral Data Repositories. The NASA-supported EcoSIS.org database for spectral measurements (not images) and ancillary measurements is maintained and curated at UW, with DOIs supplied. The UZH Specchio database includes all data made available within the agreement of the UZH open access policy.

Hyperspectral Image Processing Pipeline.
Two open-source data repositories for processing hyperspectral imagery HyTools (citation) provides workflow and code to generate functional trait maps and uncertainties from atmospherically corrected hyperspectral data sources (e.g., NEON, AVIRIS, HySpex), including BRDF and topographic correction. The open-source HyPro workflow enables processing for UAS and airplane HySpex imagery, which performs orthocorrection, atmospheric correction, and smile/keystone correction to instrument radiance data. Models and related code are stored on the NASA-supported repository EcoSML.org, with DOIs. We also have access to the Laboratory Goniometer System (LAGOS) and calibration facilities of the Remote Sensing Laboratories, UZH.

BROADER IMPACTS Education and Training
Our Institute will equip the next generation of integrative biologists with the interdisciplinary skills, capacity, perspectives, and passion required for integrative research that enhances their traditional "disciplinary" training. The Institute will provide numerous integrative training opportunities to postdoctoral associates, graduate students and undergraduates at UMN, UW, and U Maine. Postdocs will receive training at UMN, UW and U Maine; graduate students will receive training at UMN and UW; and undergraduates will receive training at UMN, UW, and U Maine. Each institution will also involve undergraduates through internal programs, as well as REUs or other NSF programs to broaden participation of diverse undergraduates. The Institute's unifying theme and overall glue will be dual foci in spectral biology and process-based modeling, with specializations in fields including micro-and macro-evolution, plant physiological ecology, genetics, plant biochemistry and metabolomics, community ecology, ecosystem ecology, phytopathology, plant-insect and plant-microbe interactions, and global ecology. We will use a variety of training opportunities and mentoring activities, which include the following:

Training Initiatives
I. Inclusive training of diverse graduate students and postdocs. The Institute will invest significantly in training of graduate students and postdocs. The UMN, UW, and Maine teams have made extensive efforts within their graduate recruiting programs to attract underrepresented students, and the Directors and Project Leaders have strong track records of recruiting diverse graduate students and postdocs including from Hispanics, Asian Americans, international, first-generation college, and LGTBQ communities. In addition, we have assembled a diverse team of Project Leaders, including women, early career researchers, and underrepresented minorities.

II. Undergraduate research opportunities.
Undergraduates will play major roles in Institute research. All investigators have strong track records training undergraduates, including recruiting and training Native American students and first generation college students, as demonstrated through prior training in the Dimensions of Biodiversity Broadening Participation program, the LTER-NASA NICE-T partnership, and the UMN and UW Undergraduate Research Experience for Undergraduates (UROP) programs.

III. Wisconsin Tribal Conservation Advisory Council (WTCAC).
Our Institute is committed to fostering an inclusive research environment and training the next generation of integrative biologists to become stewards of our natural resources. Climate and land use-change pose significant threats to forest systems via stress-induced susceptibility to native and invasive pests and pathogens. The eleven Native American Tribes in Wisconsin maintain independent forestry agencies that vary in their sustainable management approaches despite similar climate-related challenges. Therefore, Tribal forests provide a unique opportunity to determine best practices to address ecological issues while contributing to the overarching aims of our Institute. The techniques and data collected from Tribal land will, in turn, provide much-needed information for Tribal forest health specialists suffering from resource and personnel limitations. To foster collaboration between UW and Tribal forestry agents, Project Leader Trowbridge has been developing an integrative training initiative with Lacey Hill-Kastern, Tribal Pest Survey Specialist with WTCAC. WTCAC is a forum for the eleven Tribes to come together and solve natural resource and conservation issues on Tribal Lands, and Hill-Kastern will foster collaboration between UW and the Tribes. Thus, together with Tribal input, we aim to develop a large project around an issue that is of interest to many-if not all-of the Tribes (e.g. oak wilt, beech leaf disease) and that involves a clade of interest to our Institute. We anticipate that the project would be overseen by WTCAC, pending approval by the Tribes and the WTCAC Board. Graduate and undergraduate students affiliated with the Institute will train with Tribal forest health specialists and learn sustainable management practices currently used to address ecological challenges. In turn, Tribal interns and students recruited through existing WTCAC initiatives (e.g. seasonal workers, pest survey program) will learn cross-disciplinary measurement techniques and data processing with affiliates of our Institute. Through this training, we also hope to recruit Native American interns into UW M.S. programs. An M.S. degree in a forestry or entomology program is necessary to obtain a Tribal forestry agency position, empowering Native American students to make critical decisions with far-reaching effects on their own lands. Bringing students, researchers, and agencies together to address shared environmental concerns will not only enhance severely understaffed Tribal conservation efforts, but will also break down barriers to information sharing between universities and the Tribes, thus enriching both the science and the students' experiences in immeasurable ways.

IV. Annual summer workshop courses on concepts and methods.
3-4 day summer workshops will focus on specific cross-cutting themes of biological integration. In them, scientists trained in one subdiscipline will learn about concepts and methods of others, and all participants will learn how to think, observe, experiment, and model across those fields. Workshops provide a combination of skill development and training including techniques, methods in modeling and bioinformatics, and conceptual background within and across project themes. Project Leaders will be responsible for modules within the workshop, including both conceptual and technical components. Workshops will rotate annually between UMN and UW, hosted at Institute field sites or laboratories. Workshops will be open to Institute undergraduate and graduate students, postdocs, and faculty, as well as international students and postdocs from our foreign collaborators. Additional junior scientists from outside our Institute will be invited to participate to expand the impact of our training and potentially build the Institute research team.

V. Annual winter symposium on research progress.
A ca. 1-day symposium will occur annually, rotating between UMN and UW, for investigators to share plans, progress and findings from their Institute work. PIs have organized similar symposia for a number of prior projects, e.g., at National Center for Ecological Analysis and Synthesis (Cavender-Bares, Isbell), Cedar Creek LTER (Hobbie, Isbell), the German Centre for Integrative Biodiversity Research (Isbell), the National Institute for Mathematical Biology and Synthesis (Cavender-Bares, Townsend, Meireles); and many others.
VI. PhysFest 2021. UMN will host PhysFest at the CCESR in 2021 (Cavender-Bares, Reich and Montgomery in collaboration with J. Nippert, C. Still and B. Helliker). The training workshop, primarily for graduate students, has been funded by NSF-IOS and is in its fourth instantiation. Building on Institute research, we will add a spectral biology component to train students in leaf level and canopy spectroscopy and interpretation, and conceptual background and application.

VII. International and domestic exchange programs.
The Institute will enhance training opportunities for junior and early career scientists, harnessing national and international inperson and virtual training and collaboration opportunities at the University of Zurich in the University Research Priority Program on Global Change and Biodiversity, through the Canadian Airborne Biodiversity Observatory (CABO), and with scientists and engineers at the Jet Propulsion Laboratory. Training at the University of Zurich will focus on metabolomics, spectroscopy, remote sensing, and plant physiological modeling across scales. Training at CABO will focus on bioinformatics, theoretical community ecology, and spectroscopic methods.

VIII. Synthesis Working Groups.
Two Institute synthesis working groups (undergraduates, graduate students, postdocs and other senior personnel) will use two 3-4 day workshops to develop and implement synthesis plans to integrate Institute research and work (and researchers) from outside the Institute.

IX. Special Seminar Courses.
The Institute will conduct cross-institution seminars using videotechnology for both Institute and non-Institute students. These will be led by Institute investigators for both early stage (freshman) and advanced stage (upper-level undergraduate and graduate) students and address issues in biodiversity spanning genes, the tree of life, communities, ecosystems and the global biosphere.

X. Classroom-based Undergraduate Research Experiences (CURE).
At both UMN and UW-Madison we will develop course-based undergraduate research experience (CURE) labs for non-major and major biology courses that provide students the opportunity to work with biodiversity data and participate in ongoing research projects. Similar existing courses have been demonstrated to increase participation of underrepresented groups in research at early career stages (Estrada et al. 2016). Education Project Leader Willis will work with Project Co-Leader Pinto-Ledezma, currently teaching an undergraduate Biodiversity Science course at UMN, as well as faculty at both institutions, to develop programming for a biodiversity CURE.
XI. Student Recruitment. Graduate students will be recruited through organizations such as Advancing Chicanos/Hispanics and Native Americans in Science (SACNAS) and by taking advantage of institution-based fellowships for underrepresented students. Undergraduate students will be recruited from faculty members' courses, university mailing and job lists, and programs such as the North Star STEM Alliance at UMN and Undergraduate Research Scholars program at UW, programs fostering diverse scholarly communities that include students from historically underrepresented groups.
XII. Graduate student mentoring. Graduate students will be well positioned to enter the workforce for biodiversity science with a broad and integrated view afforded by the interdisciplinary training available through the proposed research. Project Leaders will mentor graduate advisees during weekly meetings to develop research questions and thesis goals, provide career guidance, discuss ideas and review progress. Lab meetings will be used to discuss primary literature and give informal presentations. Students will be encouraged to write proposals for internal and external solicitations. Extensive feedback will be provided on proposal development, manuscript preparation, and presentation skills, with opportunities to attend scientific conferences. Students will interact with diverse research groups across their own and other institutions by attending and presenting at the annual winter symposia, and attending summer workshop courses (described above). Also, students will have opportunities to engage in field work, common garden, and greenhouse experiment design and implementation, molecular bench work, plant-insect and plantfungal work, RNAseq, bioinformatics, community ecology, physiological ecology, ecosystem ecology, and modeling within and across disciplines. Funds are dedicated to cross-institution exchange for students to learn new skills and broaden their intellectual base, encouraging a systems/integrative perspective towards understanding biodiversity. Also, graduate students will have the opportunity to mentor undergraduate students, gain teaching experience through teaching assistantships, and participate in outreach (see Outreach). Annual assessments will be performed to develop learning objectives and review progress.

XIII. Undergraduate mentoring.
We have designed our undergraduate training program based on Project Leaders' experience and peer-reviewed best practices (Emery et al. 2019). Project Leaders will work with postdocs and graduate students to design potential undergraduate projects. Beginning students will receive training at their home institution to learn the integrative nature of the work and how their work fits into a larger picture. They will be encouraged to attend lab meetings, Institute meetings, and summer training workshops. Project Leaders will discuss mutual expectations and milestones with students at the beginning of the project. Subsequently, the Project Leader, graduate student, or postdoc mentor will meet weekly with students to discuss research concepts and assess understanding and progress on individual projects. Students will be required to conclude their experience with a presentation to their research group and will be encouraged to present findings and their experience at student research events. Further, students will be encouraged to make intellectual contributions and be involved in manuscript writing. The result will be students that are trained in basic science concepts and methods across a broad range of biodiversity research areas, ready to enter the workforce or compete for graduate research positions.

XIV. Evaluation and Assessment.
The efficacy of these educational and training efforts will be assessed on two main fronts. Lead by Dr. Willis in cooperation with other faculty in the Dept. of Biology Teaching and Learning (BTL) at UMN, we will assess the success of these programs (across institutions) based on pre-defined learning outcomes and learning success indicators. These efforts will focus in particular on the evaluation of the undergraduate and graduate learning programs (mentorships, seminar groups, CUREs). Second, we will assess how these programs promote inclusivity and confidence in undergraduate and graduate students, particularly, students in underrepresented minority groups. Additionally, for graduate students, we will assess how participation in these programs contributes to their career advancement and future participation in interdisciplinary research networks. Both undergraduate and graduate assessments will be performed using several pre-and post-surveys administered to students over the course of their participation in a program. Administration and analysis of both undergraduate and graduate assessments will be led by Willis in cooperation with other BTL faculty and staff.

Outreach
We will partner with the Wisconsin Tribal Conservation Association (via initiatives described above, Activity III), the Bell Museum, Minnesota's state natural history museum (Activity XV), the Cedar Creek Ecosystem Science Reserve (CCESR) (Activity XVI), Neptune Studio's MinuteEarth video channel (Activity XVII) and MarketScience.org (Activity XVIII) to interpret and connect the process and outcomes of our research, and the researchers themselves, with public audiences.
Diversity and inclusion. The Institute will diversify and broaden participation among team leadership, researchers, students, and trainees through a team leadership that is balanced among genders and career stages and is inclusive of researchers from underrepresented groups. Our team has a strong track record of mentoring junior scientists of diverse and underrepresented backgrounds and is committed to prioritizing diversity and inclusiveness in research and training of undergraduates, graduate students and postdocs.

Wisconsin Tribal Conservation Advisory Council.
The activities associated with WTCAC are detailed above under Education and Training (III), but as described above, this effort also includes a considerable outreach component, including presentations and participation in Institute activities and workshops.
Bell Museum. Institute researchers will leverage previous collaborations with the Bell Museum on an exhibit space called "Imagine the Future." The 1,600 square-foot exhibit area focuses on three important lessons from nature, critical for addressing the grandest challenges facing people and the planet: Thriving with Diversity, Living Within Limits, and Adapting to Change. The Thriving with Diversity section, which includes interpretive panels, casework with specimens and tools, and an interactive video kiosk, currently highlights ground-breaking biodiversity research from CCESR, and is set against a largescale mural mosaic of spectral images from the BioDIV experiment. In Year 2 of the proposed project, Project Leaders will collaborate with the Bell Museum exhibits team to refresh Thriving with Diversity content that highlights the project's integrative approaches to studying biodiversity across scales, with a special focus on process and outcomes. Institute researchers will assist with content development and be featured in panel text and associated interactive videos. As they are produced (see below), the three Minute Earth videos will be incorporated into the interactive video kiosk as part of the exhibit. The Bell Museum will assess visitation and experience in the updated gallery as part of the visitor experience exit survey it employs (administered via participation in COVES, Collaboration for Ongoing Visitor Experience Studies). Survey results will be assessed with Institute researchers to inform future modifications.
The Bell Museum hosts a regular public science event series called Spotlight Science, which showcases researchers working in specific areas. These have recently included chronic wasting disease, neuroscience, and plant biology. Researchers across career stages engage visitors through brief presentations and hands-on activities where the primary goal is to create conversation and opportunities for mutual learning between scientists and the public. During each year of the Institute, the Bell Museum will work with the project team to host an Institute-themed Spotlight Science. Prior to each year's event, Amber Kastner, the Bell Museum's public science events manager, will host a workshop for Institute researchers (with a special emphasis on graduate students and early career researchers) on best practices in public engagement and science communication. She will help researchers develop meaningful and engaging hands-on activities to be used during Spotlight Science and subsequently repurposed for other types of public engagement, including CCESR's monthly Lunch with a Scientist series and annual Open House events. We will assess outcomes on both visitors and participating scientists (using surveys and methodologies developed by the NSF AISL-funded EvalFest team), and use the results to inform and modify programs as necessary.
Last year the Bell Museum served over 230,000 visitors from every Minnesota county and nearly every US state. This included nearly 25,000 K-12 students, over half of whom were members of underrepresented minority groups and nearly 75% were economically disadvantaged. The Bell Museum Public Engagement and Science Learning team includes informal science educators, science communicators, and exhibit and program designers who are skilled in interpreting contemporary research for, and determining its impact on, public audiences. They have specific expertise in working with researchers across career levels (from graduate students to senior scientists) and disciplines to develop exhibits, programs, and activities that engage visitors in meaningful ways.

CCESR K-12 activities for diverse learners and teacher development.
CCESR is a biological field station with a legacy of ground-breaking science dating to the 1940s. CCESR hosts more than 14,000 visitors annually for hands-on science experiences. Half of these visitors are K-12 students and their teachers, the majority of whom are from public school districts with significant underrepresented minority populations. In addition, CCESR has number of programs that specifically target underserved students, including a partnership with the American Indian Education program in a local school district, a high school serving recent immigrants, a number of special education programs for young adults, and several charter schools that explicitly serve Muslim students, Hispanic/Latinx students, English Language Learners, and other underserved populations. CCESR's education staff work closely with site scientists to develop programs that bring science to life and build a pathway of literacy and learning for all ages.
We will host a local K-12 teacher at CCESR each summer during the project. The teacher will receive a stipend to work alongside project investigators to assist with fieldwork and data collection. At the end of the summer and into the fall, CCESR education staff will assist the teacher in developing a lesson or activity related to their experience on the project. This lesson will be delivered by the teacher in their classroom and be made available to other teachers via CCESR's website. Additionally, funding from the Institute will support the affiliated teacher to bring up to four classes of students on a field trip to CCESR to build on their in-classroom lesson. We will prioritize recruiting teachers from underrepresented groups serving in high-need school districts who have previous connections with CCESR. A new teacher partner will be selected for each summer. Dr. Caitlin Barale Potter, education and community engagement coordinator at CCESR, will oversee the teachers and coordinate the fieldwork and class visits.

MinuteEarth.
To further the reach of our public engagement outside of the Upper Midwest, MinuteEarth, in cooperation with team researchers, will produce three videos about linked themes from our collective research at UMN, UW-Madison and beyond. Neptune Studio's MinuteEarth is a popular (>300 million views) video science channel on YouTube and other platforms that has produced >190 short videos (2-3 minutes) released in English (but many also in Spanish and Portuguese) versions on YouTube, including many on biology, ecology, and global environmental change, geared to the general public audience. Co-Director Reich is a member of the MinuteEarth production team. The MinuteEarth team is expert in translating complex scientific concepts into language and visuals understandable by nonscientists, yet doing so with great scientific rigor and with an entertaining style that attracts sizeable audiences. Videos on topics such as "the economic value of biodiversity", "the use of AI in ecological research", "honeybee chemical defenses", "orchid evolution and ecology", and "the tree of life" have each been seen by hundreds of thousands, and often millions, of viewers. One video will focus on plants, light and spectra, introducing viewers to the concept of hyperspectral data in close coordination with Co-Director Townsend. The specific topics for the three videos will be developed jointly by Institute and MinuteEarth teams to optimize communication of important science, and to best represent our Institute themes, while also attracting as large an audience as possible.
Market Science. Our Science Team will further engage the public through the Market Science program at UMN, which has been running since 2014. Market Science is a collective of scientists from the University of Minnesota, and around the Twin Cities, sharing science through hands-on learning activities for kids, answering scientific questions for market goers, and creating conversations between researchers and their communities. The program involves scientists of all career stages-from students on up-who aim to connect people with science through public talks and hands-on experiences at venues including farmer's markets and county fairs in Minnesota. We will prepare short, interactive teaching modules to present via Market Science. These modules will explain how human activities are impacting biodiversity and how we can monitor and understand these impacts through NEON and from space.

Data Management Plan
This data management plan follows best practices for data management throughout all stages of the data life-cycle according to practices outlined by DataOne, Data Carpentry and Software Carpentry.
1. Data types to be produced: Spectra from individual leaves/canopies will be stored in CSV or spreadsheet format on locally-managed and backed up to servers at their respective universities before being transferred to EcoSIS (see below), Airborne imaging spectroscopic data collected in support of this project by UW-Madison (airplane or drone) will be in ENVI format and stored on the UW Campus Computing Initiative (CCI) Cluster and distributed initially via secure website; the data collected by UZH (airplane) are stored via the Specchio server (backed up nightly). LiDAR point cloud data (LAS format) will be collected by UAV or terrestrial laser scanner, with the data transferred to locally-managed and backed-up servers at project teams' local universities. Plant growth data and other field or lab plant measurements (e.g., chemistry, physiology, community data) will be collected at numerous field sites by investigators from all participating institutions, and will be stored as CSV or spreadsheet files in a common format, and backed up to a central server daily. High-throughput sequencing data will be collected and analyzed at each site and shared via the Open Semantic Framework (OSF) project instance. High-throughput sequencing data generated by the UMN Genomics Center is stored at the Minnesota Supercomputing Institute on Tier 2 research data for access by all project personnel. Sequencing data generated at the UZH will be maintained as described below under Storage and Backup. Metabolomics data will be stored and backed up locally on the UW-Madison CCI before dissemination. Computer code and software created during the project will be hosted publicly on GitHub, GitLab or Bitbucket. Tutorials and user documentation written for any of the tools developed for this project will be hosted and maintained on the open repositories. Tissue samples and vouchers used in this project will be obtained from existing arboreta collections. If voucher specimens are not already available, voucher specimens will be collected, prepared, and deposited at the UMN Herbarium. Tissues will be destroyed for transcriptomic and metabolomics analysis, with remaining samples stored in -80°C freezers in each PI's Lab. Curriculum materials, including presentation slides, tutorials, code, and test data sets, will be developed for training workshops and seminars. Lessons, notes, and teaching aids will be created for K-12 classrooms. Short videos and panel text for interpretive displays will be developed for museums.
Storage and Backup: Data stored on UMN servers is backed up with snapshots taken 3x daily. 90% of storage is replicated across multiple locations (total 90 snapshots). Data at UZH is stored on the URPP-GCB Dropbox Business account which provides version control, as well as file recovery for 180 days, and additionally backed up to a local server daily; files on the local server are backed up daily to a separate hard disk and additionally archived to two separate magnetic tapes. All UW-Madison data are stored on the Campus Computing Initiative (CCI) servers and backed up to tape nightly. In addition, the Townsend lab has a server on which working data are incrementally backed up hourly.
2. Data standards to be used: Data generated by the project will be retained in scriptreadable formats: CSV or spreadsheet file for numerical data; GeoTIFF, HDF5 or ENVI for imagery; and NetCDF, shapefiles and KMLs for GIS data. Upon initiation of the CCESR, a standard format will be distributed for metadata and metadata entries will be published in a searchable interface on the Institute website upon publication of the associated datasets, including a link to the published dataset, to facilitate data re-use. To assure quality control and reproducibility of the project outputs, two secure cloud-based platforms will allow the distributed project team to organize, annotate, and document the data. Open Science Framework (osf.io) will be used by the project team to collaborate and create shared documentation via the wiki function and the connected shared drives. OSF will ensure good communication between the team members and act as a collaborative electronic lab notebook for disparate data types and sites. Fulcrum (www.fulcrumapp.com) will be used by the project team to record site information (including metadata) for all leaf spectral data, enabling interoperability among all participants through consistent cataloguing and descriptions for all participants. Fulcrum data are instantaneously backed up to the cloud when connected, and will be exported for backup and extraction when the project is complete.
3. Data management roles and responsibilities: Information managers at UMN and UW-Madison will coordinate data management activities in consultation with the Director, Co-Directors, Managing Directors and the science team. All science team members will be required to use Fulcrum for lab and field data collection that will be synced on a common server. Project Leaders will develop data entry forms and approve data submissions. If any member of the Institute directors, project leaders, or information managers leaves the project, their data management responsibilities will be transferred to one of the remaining directors, either until the end of the project or a replacement is named.
4. Data dissemination methods and access: Leaf and plot hyperspectral data and any associated measurements/descriptions (e.g., species information, chemical/physiological measurements, metabolomics etc.) will be publicly searchable and available through EcoSIS.org, a NASA-supported repository hosted on Amazon Web Services. EcoSIS supplies DOIs for all contributions and is managed and curated by co-PI Townsend at UW-Madison. EcoSIS is backed up daily. Airborne imaging spectroscopic and LiDAR data collected by our project team will be publicly searchable and downloadable from NASA Distributed Active Archive Centers (DAACs). Image data collected by third parties (e.g., NASA, NEON) will be available through their portals. Plant growth data from experiments at Cedar Creek Ecosystem Science Reserve (CCESR) will be made publicly available via the Environmental Data Initiative (EDI) with links to the CDR LTER website, an NSFfunded resource for long-term public access and preservation. Data collected in Minnesota but not at CCESR will be stored on shared, locally managed servers at UMN and deposited into the UMN Libraries' Data Repository for the University of Minnesota (DRUM), an open access core-trust seal certified institutional data repository (more information on DRUM at 6. Data archiving below). High-throughput sequencing data of individual plants will be shared and distributed via the NCBI Sequence Read Archive (SRA). Phylogenetic datasets, including trees and analysis information, will be deposited to the OpenTree project, allowing for further annotation and sharing of the results. In addition, analysis files for publications, such as sequence alignments, climatic niche and metabolite data matrices, any other source data files, and various associated statistical analyses will be permanently stored in the Dryad Digital Repository. Computer code and software will be developed under the GPL (Gnu Public License) or similar certified open-source license (as defined by the Open Source Initiative). All code for processing of hyperspectral data (leaf measurements and image data) will be distributed through EcoSML.org, a NASA supported searchable open-source code repository linked to GitHub, GitLab and BitBucket that supplies DOIs for all contributions. EcoSML is managed and curated by Co-PI Townsend at UW-Madison. All code repositories will be actively maintained and updated by the authors after publication. In addition, the version used for the publication will be provided in the supplemental material of all publications produced. Tissue samples will be freely available for nonprofit research uses upon permission from the original tissue provider after completion of the project. Curriculum materials will be deposited in DRUM for public availability with the exception of K-12 classroom materials, which will be made publicly available on the CCESR website.
5. Data policies: Data access will be restricted to the project personnel during collection and analysis and released immediately upon publication of research findings. If data are archived prior to publication, the data will be embargoed until date of publication. Data will be released under a Creative Commons BY 2.0 License requesting attribution for reuse.
6. Data archiving: The archive repositories described above ensure long-term preservation and access. All software code hosted on GitHub will be archived in the GitHub Archive Program. Software code hosted on BitBucket will be submitted to the Software Heritage archive (softwareheritage.org). Digital data not already in one of these archives at project's end will be deposited in DRUM. All archived data will include the appropriate documentation, metadata, and code to facilitate reuse.

US National Science Foundation (NSF) Award Number: 2021898
Division of Biological Infrastructure (DBI) Biology Integration Institutes

Grant title
BII-Implementation: The causes and consequences of plant biodiversity across scales in a rapidly changing world

Hosting institution
University of Minnesota