Podcast
Questions and Answers
What should be done after identifying a cancelled client in the BigQuery system?
What should be done after identifying a cancelled client in the BigQuery system?
What is the purpose of highlighting data in red in the tracker data file?
What is the purpose of highlighting data in red in the tracker data file?
When checking client IDs, what should be done if a client has re-signed after cancellation?
When checking client IDs, what should be done if a client has re-signed after cancellation?
Which of the following should be checked alongside the new customer log?
Which of the following should be checked alongside the new customer log?
Signup and view all the answers
What indicates that a client is no longer active?
What indicates that a client is no longer active?
Signup and view all the answers
What should be done with clients identified as 'N.A.' in the new customer log?
What should be done with clients identified as 'N.A.' in the new customer log?
Signup and view all the answers
Which software tool is used to verify the presence of client IDs?
Which software tool is used to verify the presence of client IDs?
Signup and view all the answers
Which behavior might indicate a client is trying to return after cancellation?
Which behavior might indicate a client is trying to return after cancellation?
Signup and view all the answers
What is the first step in cleaning up the client queries?
What is the first step in cleaning up the client queries?
Signup and view all the answers
Which task was deemed a priority when running BigQuery?
Which task was deemed a priority when running BigQuery?
Signup and view all the answers
What is the first file needed for the process described?
What is the first file needed for the process described?
Signup and view all the answers
Which two files need to be opened simultaneously?
Which two files need to be opened simultaneously?
Signup and view all the answers
What is the purpose of cleaning the BigQuery data?
What is the purpose of cleaning the BigQuery data?
Signup and view all the answers
What is the significance of copying client IDs from the bottom of the list?
What is the significance of copying client IDs from the bottom of the list?
Signup and view all the answers
What should be done before addressing any questions?
What should be done before addressing any questions?
Signup and view all the answers
What is the role of the Client Intake Form in the cleaning process?
What is the role of the Client Intake Form in the cleaning process?
Signup and view all the answers
What action should be taken after the recording?
What action should be taken after the recording?
Signup and view all the answers
How should the screen be shared during the process?
How should the screen be shared during the process?
Signup and view all the answers
Why was there a misunderstanding regarding the microphone?
Why was there a misunderstanding regarding the microphone?
Signup and view all the answers
What should be done if there is confusion during the discussion?
What should be done if there is confusion during the discussion?
Signup and view all the answers
What is the purpose of using VLOOKUP in this context?
What is the purpose of using VLOOKUP in this context?
Signup and view all the answers
What is the first step suggested for cleaning up the client ID data?
What is the first step suggested for cleaning up the client ID data?
Signup and view all the answers
After copying a client ID, what is the next suggested action?
After copying a client ID, what is the next suggested action?
Signup and view all the answers
What should be done after processing the canceled client IDs?
What should be done after processing the canceled client IDs?
Signup and view all the answers
Which method is suggested for removing entries in BigQuery?
Which method is suggested for removing entries in BigQuery?
Signup and view all the answers
What caution is advised when dealing with client names that are similar?
What caution is advised when dealing with client names that are similar?
Signup and view all the answers
What function is specifically mentioned to handle separating client IDs from other data?
What function is specifically mentioned to handle separating client IDs from other data?
Signup and view all the answers
What indicates that a client has been deleted from BigQuery?
What indicates that a client has been deleted from BigQuery?
Signup and view all the answers
How should unnecessary words or data be handled according to the instructions?
How should unnecessary words or data be handled according to the instructions?
Signup and view all the answers
Which operation is NOT suggested after identifying canceled clients?
Which operation is NOT suggested after identifying canceled clients?
Signup and view all the answers
What is the first action to be undertaken in the cleaning process?
What is the first action to be undertaken in the cleaning process?
Signup and view all the answers
Which two files are essential to open simultaneously during the cleaning process?
Which two files are essential to open simultaneously during the cleaning process?
Signup and view all the answers
What should be done prior to addressing any questions during the cleaning process?
What should be done prior to addressing any questions during the cleaning process?
Signup and view all the answers
Why is it necessary to check the Client Intake Form during the cleaning process?
Why is it necessary to check the Client Intake Form during the cleaning process?
Signup and view all the answers
During the process, how should the screen be shared?
During the process, how should the screen be shared?
Signup and view all the answers
What should be done to the client IDs in the BigQuery after they have been identified as cancelled?
What should be done to the client IDs in the BigQuery after they have been identified as cancelled?
Signup and view all the answers
Which is the correct function to use for checking whether client IDs are in the canceled clients list?
Which is the correct function to use for checking whether client IDs are in the canceled clients list?
Signup and view all the answers
What should be done after deleting client queries from BigQuery?
What should be done after deleting client queries from BigQuery?
Signup and view all the answers
What is the main step involved in separating client IDs from other data in BigQuery?
What is the main step involved in separating client IDs from other data in BigQuery?
Signup and view all the answers
When identifying the client query entries, what is essential to avoid confusion with similar names?
When identifying the client query entries, what is essential to avoid confusion with similar names?
Signup and view all the answers
What should be done first when updating BigQuery?
What should be done first when updating BigQuery?
Signup and view all the answers
What action is required after replacing a client's query in BigQuery?
What action is required after replacing a client's query in BigQuery?
Signup and view all the answers
What should be done if a new client does not have an existing query in BigQuery?
What should be done if a new client does not have an existing query in BigQuery?
Signup and view all the answers
When should the 'Select to Union All' command be used during the updating process?
When should the 'Select to Union All' command be used during the updating process?
Signup and view all the answers
What characteristic distinguishes rows that need edits in the tracker data?
What characteristic distinguishes rows that need edits in the tracker data?
Signup and view all the answers
What is the final step to ensure changes are saved in BigQuery?
What is the final step to ensure changes are saved in BigQuery?
Signup and view all the answers
If a client entry is already present in the all clients latest query and there's a new query for that client, what should be done?
If a client entry is already present in the all clients latest query and there's a new query for that client, what should be done?
Signup and view all the answers
What is the purpose of highlighting rows in yellow in the tracker data?
What is the purpose of highlighting rows in yellow in the tracker data?
Signup and view all the answers
What should be done with entries highlighted in yellow in the query?
What should be done with entries highlighted in yellow in the query?
Signup and view all the answers
How are queries marked for removal indicated?
How are queries marked for removal indicated?
Signup and view all the answers
What should be confirmed to check if new data has arrived?
What should be confirmed to check if new data has arrived?
Signup and view all the answers
What is the purpose of using chat GPT in the documentation process?
What is the purpose of using chat GPT in the documentation process?
Signup and view all the answers
What is the recommended method for handling the transcript before documentation?
What is the recommended method for handling the transcript before documentation?
Signup and view all the answers
What is the first step to take in the VM Instances window?
What is the first step to take in the VM Instances window?
Signup and view all the answers
How long does the script 'load_json_to_bq.sh' typically take to finish running?
How long does the script 'load_json_to_bq.sh' typically take to finish running?
Signup and view all the answers
What should be done after running the SQL query in BigQuery?
What should be done after running the SQL query in BigQuery?
Signup and view all the answers
Which command is NOT used in the SSH terminal after navigating to the ai-enric directory?
Which command is NOT used in the SSH terminal after navigating to the ai-enric directory?
Signup and view all the answers
What should be done to avoid data duplication or incorrect entries in the Tracker Data Spreadsheet?
What should be done to avoid data duplication or incorrect entries in the Tracker Data Spreadsheet?
Signup and view all the answers
What action must be taken to run the function in Cloud Functions?
What action must be taken to run the function in Cloud Functions?
Signup and view all the answers
Which command creates a new screen session in the SSH terminal?
Which command creates a new screen session in the SSH terminal?
Signup and view all the answers
After closing SSH, what command should be re-entered to monitor logs?
After closing SSH, what command should be re-entered to monitor logs?
Signup and view all the answers
How should the output of the AI matching script be described in terms of performance?
How should the output of the AI matching script be described in terms of performance?
Signup and view all the answers
What action follows clearing the data in the Tracker Data Spreadsheet?
What action follows clearing the data in the Tracker Data Spreadsheet?
Signup and view all the answers
Study Notes
Data Cleaning Procedure for BigQuery
- Tools Required: BigQuery, traffic tracker data file, client intake form file
- Initial Steps: Open BigQuery, All Clients Latest Files. Clean BigQuery
- Simultaneous Viewing: View BigQuery and Client Intake Form side-by-side.
- Client Status Check: Use the Client Intake Form to identify cancelled or active clients.
- Copy Client IDs: Copy all client IDs from the bottom of the All Clients Latest BigQuery list.
- Create New Sheet: Create a new sheet (e.g., December 5, BQ Client IDs) to store the client IDs.
- Extract Client IDs: Paste client IDs and separate them from extraneous text using data tools (e.g., 'Split text to columns').
- Sort Client IDs: Sort the client IDs alphabetically in a column.
- Identify Cancelled Clients: Cross-reference client IDs in the sorted list with the “Canceled Clients” tab to find canceled client IDs.
- Remove Cancelled Clients' Queries: Remove IDs marked as cancelled from the BigQuery, avoiding deletion of active client IDs.
- Transfer Queries to Tracker Data: Copy and paste client IDs found in the canceled client list into the Tracker data, replacing the corresponding query.
- Delete from BigQuery: Delete client queries from BigQuery linked to canceled IDs ensuring no active client is removed.
Additional Considerations
- Highlight Canceled Clients: In the tracker data file, highlight rows of canceled clients in red, providing visual cues to the team.
- Handle Duplicate Names: Be mindful of duplicate names when removing clients from BigQuery. Ensure you do not remove queries for clients with the same name but different IDs.
- Old Clients: Delete queries for clients who are no longer active but were in the BigQuery query.
- New Customers: Cross-reference and remove client IDs not present in the new customer log for potentially inactive clients.
- Search and Creation Log Check: Check if client IDs are also in the search and creation log; if present, ensure queries are not deleted, and highlight rows.
- Active Client Verification: Add or update client IDs in new customer log that may have been inaccurately marked as inactive to keep active clients in the query.
Finalizing the Process
-
Check for Hidden Rows: Always check for hidden rows in Tracker data files.
-
Removal of N/A clients: Remove client IDs that are not present in the new customer or search and curation logs, indicating potentially inactive clients.
-
Highlighting Prioritization: Prioritize highlighting removed IDs in red for visibility and communication across the tracker team.
-
Ensure Completeness: Review and verify that all necessary steps are taken before running the BigQuery query to avoid errors.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Description
This quiz covers the essential steps for cleaning and organizing client data using BigQuery.