以下是使用dbplyr和bigrquery包的示例代码来获取在5-95分位数之间的均值、最大值、最小值和标准差。
library(dplyr)
library(dbplyr)
library(bigrquery)
con <- dbConnect(
bigrquery::bigquery(),
project = "my_project",
dataset = "my_dataset"
)
my_table <- tbl(con, "my_table")
my_table %>%
summarize(
Mean = mean(column_name),
Max = max(column_name),
Min = min(column_name),
SD = sd(column_name),
Q5 = quantile(column_name, 0.05),
Q95 = quantile(column_name, 0.95)
) %>%
collect()
请确保将“column_name”替换为您要分析的列的名称。