Skip to contents

All functions

build_corpus_index()
Build a Parquet index for fast ID lookups in a parquet corpus
compatibility_report()
Render and open the compatibility report
extract_doi()
Extract DOIs or Components from Character Vectors
id_block()
Compute ID block from OpenAlex IDs
infer_json_schema()
Infer unified JSON schema using DuckDB
jq_execute()
Execute a jq transformation from an OpenAlex-style JSON to JSONL
lookup_by_id()
Look up records by ID using a pre-built index
opt_filter_names()
Get available filter names from OpenAlex API
opt_select_fields()
Get available select fields from OpenAlex API
prepare_snapshot()
Prepare a directory for OpenAlex snapshot management
pro_download_content()
Download full-text PDFs or TEI XML for OpenAlex works
pro_fetch()
Fetch and convert OpenAlex data
pro_query()
Build an OpenAlex request (httr2)
pro_rate_limit_status()
Check OpenAlex rate limit status
pro_request()
Fetch works from OpenAlex
pro_request_jsonl()
Convert JSON files to jsonl files
pro_request_jsonl_parquet()
Convert JSON files to Apache Parquet files
pro_validate_credentials()
Validate OpenAlex credentials
read_corpus()
Read corpus from Parquet Dataset
sample_parquet_n()
Sample rows from Parquet files using DuckDB reservoir sampling
snapshot_to_parquet()
Convert OA snapshot to Parquet format