Adds filters missing in GraphQL. Adds graphs missings in SQL. The one language to rule them all.
Overview
Kusto is named after pioneering Oceanographer Jacque Custou (pronounced “Kusto”). Like the language, he dove deep into a vast ocean for new insights.
DOCS:
NOTE: Content here are my personal opinions, and not intended to represent any employer (past or present). “PROTIP:” here highlight information I haven’t seen elsewhere on the internet because it is hard-won, little-know but significant facts based on my personal research and experience.
Compared to SQL, KQL is less verbose but is proprietary to Microsoft.
The language is read-only, of up to 5 GB.
KQL is used in several Azure products, include the ADX managed SaaS service (GA 2019) for big data exploration in NRT (Near Real Time):
References:
-
2.45 hr VIDEO COURSE: Exploring Data in Microsoft Azure Using Kusto Query Language and Azure Data Explorer by Neeraj Kumar (@mstechtrainings) makes use of NOAA’s Storm Events Database.
- 4h VIDEO COURSE: How to Start with Microsoft Azure Data Explorer (ADX) 29 Jun 2020 by Xavier Morera
- https://www.pluralsight.com/partners/microsoft/azure-data-explorer
-
https://docs.microsoft.com/en-us/azure/data-explorer/security-baseline
-
This article on KQL is one of Neeraj Kumar’s excellent series on data science using Azure.
-
https://docs.microsoft.com/en-us/azure/data-explorer/kusto/query/best-practices
- https://github.com/Azure/Azure-Security-Center/tree/master/Kusto/Azure%20Resource%20Graph
Example:
This example of a KQL query draws data from the Heartbeat dataset. rendered as a timechart:
Hands-on:
-
Open demo data from Microsoft:
https://aka.ms/LADemo which resolves to:
https://portal.loganalytics.io/demo -
Click the book icon at the right and select “Online course” for:
4 hr VIDEO COURSE: Kusto Query Language (KQL) from scratch by Robert Cain, who also has an Advanced course.
-
Click the book icon at the right and select “Language Reference” for:
https://docs.microsoft.com/en-us/azure/data-explorer/kusto/query/
Observe that KQL is part of Azure Data Explorer.
- Click “Query explorer” tab at the right.
- Expand “Saved Queries”
- Double-click on “Pluralsight” to expand the category.
-
Click to open “m2-table-80-percent”.
OBSERVE: Clicking completely replaces the existing KQL entry, without needing to clear it first.
// precede all comments in code.
-
Highlight a query, click blue “Run” or hold down Shift and press Enter.
CAUTION: “No result” will be returned if there is no data.
“Perf” is the table name to search in. This table has these fields visible:
TenantId Computer ObjectName ObjectName CounterName InstanceName -
To see more columns, drag the horizonal bar or on the Touchpad move two fingers to the left:
CounterValue CounterPath StandardDeviation Type _ResourceID TenantID SourceSystem MG -
PROTIP: The full list of columns for this and all other tables is:
https://docs.microsoft.com/en-us/azure/azure-monitor/reference/tables/perf
-
Set the Time Frame or set in the script text:
| where TimeGenerated >= ago(1h)
PROTIP: Several where statements can be stacked.
Operators
-
For operators, click on the KQL query text area and press command+Enter:
- where - filter
- count
- extend - creates a calculated column in the result set (before project)
- join
- limit
- lookup
- order
- project - select a subset of columns (instead of all columns from table)
- project-away - remove column
-
To scroll for more, mouse over the list and slide two-fingers on the Mac Touchpad:
- sort
- summarize
-
search
- distinct - to not repeat values shown
- make-series
- mv-apply
- mv-expand
- take 5 // return 5 records
- top 20 // rows
- top-nested
…
The Pluralsight video covers search, where, take, count, summarize, extend, project, distinct.
Examples to limit too much being returned (and wasting time):
| search kind=case_sensitive "memory"
| search in (Perf, Event, Alert) "Contoso"
-
Use colon to search text wildcard:
| search CounterName:"MBytes"
-
Limit column:
| search * starswith "Bytes"
| search * endswith "Bytes"
| search "Free*Bytes" // Any that Begins with free or ends with bytes
| search InstanceName matches regex "[A-Z]:*"
-
Click “m3-demo-scalar” explained by this VIDEO covering Scalar Operators:
print, now() UTC, ago(-7d), sort by asc, extract, parse, datetime, Timespan Artithmetic, startof…, endof…, between, todynamic, format_datetime, format_timespan, datetime_aart, case, iif, isempty/isnull, split, String Operators, strcat
-
Click “m4-demo-advanced-aggregations” explained by this VIDEO
- summarize arg_max/arg_min( column ),
- makelist - flaten a hierarchy to a JSON array, allowing dup. values
- makeset - flaten a hierarchy to a JSON array, removing dup. values
For a list of PCs with low disk space:
Perf | where CounterName == "% Free Space" | and CounterValue <= 30 | summarize Computers = makeset(Computer)
- mvexpand, percentiles, dcount (distinct count, accuracy), dcountif, countif, pivot, top-nested, max/min, sum/sumif, any
Datasets
-
Click “m5-demo-working-with-datasets” explained by this VIDEO
- let, join (tables), union (combine) with source, kind=outer
- datatable, prev/next, toscalar, row_cumsum, materialize
Time Series
-
Click “m6-demo-time-series” explained by this VIDEO
Machine Learning
-
Click “m7-data-machine-learning” explained by this VIDEO Machine Learning
- evaulate basket(threshold) - for the most frequently appearing combination of attributes, given the threshold for minimum frequency (default 0.05 or 5%)
- autocluster
- evaulate diffpatterns(EventLevelName, ‘Error’, ‘Warning’) // splits dataset to identify differences as “Error” or “Warning”. Use iif to flag metrics before and after the incident.
- reduce by Computer with threshold = 0.6 // to determine pattern, with default threshold of 0.1.
Exporting data to CSV
-
Click “m8-exporting-data” explained by this VIDEO
- To export to CSV file, run query and click the export icon.
- Select Export to CSV - all columns or display columns.
- In the pop-up at the bottom, click Save As.
-
Specify the folder and file name.
Run KSL in PowerBI Desktop
- Copy the Query to your Clipboard.
- Download and install PowerBI Desktop from https://powerbi.microsoft.com/desktop
- Open PowerBI
- In Home group, Get Data - Blank Query
- Advanced Editor
- Paste the query (command+V). Done runs the query.
- Close and apply changes.
- Create visualizations, etc.
More examples
Events | where StartTime >= datetime(2018-11-01) and StartTime < datetime(2018-12-01) | where State == "FLORIDA" | count
### KQL in Data Explorer
-
VIDEO course Microsoft Azure Data Explorer - Advanced KQL by Robert Cain.
-
Download and expand microsoft-azure-data-explorer-advanced-query-capabilities.zip to view folder module-05-performing-diagnostic-and-root-cause-analysis.
-
??? Load into Azure
covers functions,
inline Python & R code (converted to KQL string by highlighting then Ctrl+K & Ctrl+S).
Analyze data using geospatial analysis,
Root Cause Analysis Diagnostics
clustering algorithms, time series analysis.
Exploring data using Kusto
-
Exploring Data in Microsoft Azure Using Kusto Query Language and Azure Data Explorer, download exercise file microsoft-azure-data-exploring.zip (to your Downloads folder) and unzip. In folder coursfiles, ??
JMESPath
The https://github.com/jmespath/jmespath.terminal Python package offers an interactive environment to work with queries. Data is piped as input, and then queries are written and run in the editor:
pip install jmespath-terminal az vm list --output json | jpterm
From: https://docs.microsoft.com/en-us/cli/azure/query-azure-cli
KQL References
- https://docs.microsoft.com/en-us/sharepoint/dev/general-development/keyword-query-language-kql-syntax-reference
CxE KQL hands-on lab exercises videos:
-
part 1 of 3 by Ofer Shezaf [Deck]
More on Azure
This is one of a series on Azure:
- DevOps_2.0
- ci-cd (Continuous Integration and Continuous Delivery)
- User Stories for DevOps
- Git and GitHub vs File Archival
- Git Commands and Statuses
- Git Commit, Tag, Push
- Git Utilities
- Data Security GitHub
- GitHub API
- Choices for DevOps Technologies
- Pulumi Infrastructure as Code (IaC)
- Java DevOps Workflow
- AWS DevOps (CodeCommit, CodePipeline, CodeDeploy)
- AWS server deployment options
- Cloud services comparisons (across vendors)
- Cloud regions (across vendors)
- Azure Cloud Onramp (Subscriptions, Portal GUI, CLI)
- Azure Certifications
- Azure Cloud Powershell
- Bash Windows using Microsoft’s WSL (Windows Subsystem for Linux)
- Azure Networking
- Azure Storage
- Azure Compute
- Digital Ocean
- Packer automation to build Vagrant images
- Terraform multi-cloud provisioning automation
-
Hashicorp Vault and Consul to generate and hold secrets
- Powershell Ecosystem
- Powershell on MacOS
- Jenkins Server Setup
- Jenkins Plug-ins
- Jenkins Freestyle jobs
- Docker (Glossary, Ecosystem, Certification)
- Make Makefile for Docker
- Docker Setup and run Bash shell script
- Bash coding
- Docker Setup
- Dockerize apps
- Ansible
- Kubernetes Operators
- Threat Modeling
- API Management Microsoft
- Scenarios for load
- Chaos Engineering