Category: GCP

  • How to Find What’s Driving BigQuery Bytes Billed

    How to Find What’s Driving BigQuery Bytes Billed

    Recently, I reduced BigQuery cost significantly by changing how we maintained a token index table used for keyword search. The original design updated the index during ingestion. Each small OCR batch triggered a MERGE into a chunk_terms table. That looked incremental, but it turned out to be expensive because BigQuery still needed to scan the…

  • Going Live on Google Cloud Marketplace

    A Technical Lead’s Experience As the technical lead for MOGOPLUS’s recent launch on Google Cloud Marketplace, I want to share our journey and key insights. MOGOPLUS provides insights from unstructured data, primarily serving the lending sector. Our decision to join the Google Cloud Marketplace stemmed from our existing relationship as a GCP customer and our…

  • BigQuery Bulk Insert using Python

    To insert multiple rows at once using DML in BigQuery, you can do something like this: The above snippet inserts multiple rows into the table (table with columns: id and createdAt) in one go. You can also see that values types are being supplied (id:STRING and createdAt:TIMESTAMP). Binding the values this way (using parameters) will…