betydata

Public Data from the BETYdb Database

Author

David LeBauer and Akash B V

Published

March 1, 2026

betydata is an R data package providing offline access to public data from BETYdb (Biofuel Ecophysiological Traits and Yields database). It enables reproducible analyses of plant traits, crop yields, and supporting metadata without requiring database connectivity.

At a Glance

Table 1: Dataset summary
Metric Value
Total observations 43,532
Unique traits 298
Plant species 70,741
Research sites 23173
Literature citations 985
Tables 16

Installation

# install.packages("remotes")
remotes::install_github("PecanProject/betydata")

Quick Start

library(betydata)
library(dplyr)

# The primary table -- key columns first, IDs last
traitsview
# Bioenergy crop observations
bioenergy_genera <- c("Miscanthus", "Panicum", "Populus", "Salix", "Saccharum")
traitsview |>
  filter(genus %in% bioenergy_genera) |>
  count(genus, sort = TRUE)

Guides

Guide Description
Getting Started Package overview, data model, and key concepts
Common Analyses Practical analysis patterns with dplyr
PFTs & Priors Plant Functional Types and Bayesian prior distributions
Manuscript Reproduction Reproduce key analyses from LeBauer et al. (2018)

Data License

Data: ODC-By-1.0 | Code: BSD-3-Clause

Citation

LeBauer, D. S., et al. (2018). BETYdb: a yield, trait, and ecosystem service database applied to second-generation bioenergy feedstock production. GCB Bioenergy. doi:10.1111/gcbb.12420