Tremendous Task: Searching for code on GitHub with BigQuery and GHTorrent

1 · Shane Mulligan · Oct. 3, 2019, 11 a.m.
Searching GitHub for regular expression matches in code is a tremendous task. Prerequisites Set up the bq command https://cloud.google.com/bigquery/docs/bq-command-line-tool Lots of money Sample search About US$5 per search. This is cheaper than searching all files. shell variable function $query a regular expression that searches the contents of files $path_re a regex that matches on the file path $path_re_exclude a regex that matches on the file path for pruning results 1 2 3 4 5...