Package index
-
build_corpus_index() - Build a Parquet index for fast ID lookups in a parquet corpus
-
compatibility_report() - Render and open the compatibility report
-
extract_doi() - Extract DOIs or Components from Character Vectors
-
id_block() - Compute ID block from OpenAlex IDs
-
infer_json_schema() - Infer unified JSON schema using DuckDB
-
jq_execute() - Execute a jq transformation from an OpenAlex-style JSON to JSONL
-
lookup_by_id() - Look up records by ID using a pre-built index
-
opt_filter_names() - Get available filter names from OpenAlex API
-
opt_select_fields() - Get available select fields from OpenAlex API
-
prepare_snapshot() - Prepare a directory for OpenAlex snapshot management
-
pro_download_content() - Download full-text PDFs or TEI XML for OpenAlex works
-
pro_fetch() - Fetch and convert OpenAlex data
-
pro_query() - Build an OpenAlex request (httr2)
-
pro_rate_limit_status() - Check OpenAlex rate limit status
-
pro_request() - Fetch works from OpenAlex
-
pro_request_jsonl() - Convert JSON files to jsonl files
-
pro_request_jsonl_parquet() - Convert JSON files to Apache Parquet files
-
pro_validate_credentials() - Validate OpenAlex credentials
-
read_corpus() - Read corpus from Parquet Dataset
-
sample_parquet_n() - Sample rows from Parquet files using DuckDB reservoir sampling
-
snapshot_to_parquet() - Convert OA snapshot to Parquet format