Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

For clients who use their own Google Cloud Platform project for storage of the documents, the documented rules/naming patterns of the BigQuery resources must be used https://boxalino.atlassian.net/wiki/spaces/BPKB/pages/415432770/Load+Request#Using-private-GCP-resources-for-DI . In such scenarios, only the step #4 is of interest.

Transform your data source

The Boxalino Data Structure is publicly available in our git repository: https://github.com/boxalino/data-integration-doc-schema

You can use the repository to identify the data formats & data elements expected for each document.

You can also validate your transformation and test a load in a BigQuery table.

For certain headless CMS, Boxalino has designed a Transformer service Transformer

Loading content to GCS and BigQuery

...