LoaderQuickSetup¶
-
class
google_pandas_load.LoaderQuickSetup(project_id=None, dataset_id=None, bucket_name=None, gs_dir_path_in_bucket=None, credentials=None, local_dir_path=None, generated_data_name_prefix=None, max_concurrent_google_jobs=40, use_wildcard=True, compress=True, separator='|', chunk_size=268435456, logger=<Logger LoaderQuickSetup (DEBUG)>)¶ Bases:
google_pandas_load.LoaderThe purpose of this class is to quickly set up a loader.
An instance of LoaderQuickSetup is simply an instance of the base class built with the following arguments:
bq_client=bq_client dataset_ref=dataset_ref bucket=bucket gs_dir_path_in_bucket=gs_dir_path_in_bucket local_dir_path=local_dir_path generated_data_name_prefix=generated_data_name_prefix max_concurrent_google_jobs=max_concurrent_google_jobs use_wildcard=use_wildcard compress=compress separator=separator chunk_size=chunk_size logger=logger
where
bq_client = google.cloud.bigquery.Client( project=project_id, credentials=credentials) dataset_ref = google.cloud.bigquery.dataset.DatasetReference( project=project_id, dataset_id=dataset_id) gs_client = google.cloud.storage.Client( project=project_id, credentials=credentials) bucket = google.cloud.storage.bucket.Bucket( client=gs_client, name=bucket_name)
- Parameters
project_id (str, optional) – The project id.
dataset_id (str, optional) – The dataset id.
bucket_name (str, optional) – The bucket name.
gs_dir_path_in_bucket (str, optional) – See base class.
credentials (google.auth.credentials.Credentials) – Credentials used to build the bq_client and the bucket. If not passed, falls back to the default inferred from the environment.
local_dir_path (str, optional) – See base class.
generated_data_name_prefix (str, optional) – See base class.
max_concurrent_google_jobs (int, optional) – See base class. Default value is 40 while the default value for the base class is 10. The first value is intended for a notebook environment whereas the second one for a production environment. In the latter case the amount of google computing resources is usually shared by several programs running in production.
use_wildcard (bool, optional) – See base class.
compress (bool, optional) – See base class.
separator (str, optional) – See base class.
chunk_size (int, optional) – See base class.
logger (logging.Logger, optional) – See base class. Default value is a logger called LoaderQuickSetup. Contrary to the default base class logger, it is set to not propagate its log records to its logger ancestors and it is equipped with an handler displaying the log records to the console. This is, for instance, convenient when working with a notebook.
-
property
bq_client¶ See the bq_client parameter of the base class.
- Type
google.cloud.bigquery.client.Client
-
property
bucket¶ See the bucket parameter of the base class.
- Type
google.cloud.storage.bucket.Bucket
-
property
dataset_ref¶ See the dataset_ref parameter of the base class.
- Type
google.cloud.bigquery.dataset.DatasetReference
-
property
gs_client¶ The Storage client used to create the bucket.
- Type
google.cloud.storage.client.Client