Bar Charts
Bar Charts is a simple way to visualize grouped data through colorful
rectangles. In the world of text buffers and shell output, it is
surprising that Emacs has a library to draw colored rectangles:
chart
.
(require 'chart) (chart-bar-quickie 'vertical "Favorite Type of Movie" ;; Type & Title '("Comedy" "Action" "Romance" "Drama" "Sci-Fi") "Genre" ;; Keys & Label '(4 5 6 1 4) "People" ;; Values & Label )
As the screenshot shows above, it is not sophisticated enough to
warrant deep examination but still impressive within the realm of text
buffers. The context of being builtin leads to fascinating queries
about the Emacs interpreter such as memory-usage
.
If you want to try it out, you can use the command load-library
to
import chart
. It defines the following demo command for exploration:
chart-emacs-storage
- This charts the output of
memory-usage
chart-emacs-lists
- Like
chart-emacs-storage
, this counts buffers, frames, processes, faces and x-display within the Emacs session chart-rmail-from
- With
rmail
, it groups user sender occurrences by mail chart-space-usage
- After selecting a directory, it shows the disk usage by file type within said directory
chart-file-count
- After selecting a directory, it displays file type occurrences within the directory
chart-test-it-all
- A chart testing positive and negative values
Aside from those demo commands, how can we use it and make a chart?
Making A Chart
After some reading and exploration, the starting command or function
to understand this library is chart-bar-quickie
, the rest seem
internal. Although this function has a total of eight arguments which
two are optional, it encapsulates everything needed to make the chart
including labels and data rather than instantiating classes and
objects. A small snippet to elaborate:
(defun fn/chart-comparator-< (left right) "A sample comparator for the values. Each parameter is a key-value cos pair and should function as a comparator like `<', `=', or `>'." (pcase-let ((`(,left-key . ,left-value) left) ;; left is a cons of key and value (`(,right-key . ,right-value) right) ;; ditto with right ) (< left-value right-value))) (chart-bar-quickie 'vertical ;; Chart direction, either 'vertical or 'horizontal "Chart Title" ;; Chart title '("Alpha" "Beta" "Gamma" "Delta") ;; X-Axis values or keys "X Line" ;; X-Axis label '(4 3 2 1) ;; Y-Axis values "Y Line" ;; Y-Axis label ;; Optional 3 ;; Max value allowed, anything higher is not shown #'fn/chart-comparator-< ;; Sorting or ordering function )
Once evaluated, this creates a new buffer based on the title and
renders the bar chart there. The display itself is inert unlike
tabulated-list-mode
or any special buffer modes, so do not expect
special key bindings or commands that sort, zoom, refresh the chart
out of the box. Aside from tempering expectations, all the major (if
not all) functionality is within that function. (Sadly, looking any
deeper does not yield any more insights.) Lastly, the only caveat here
is that the size of the keys and values should be the same which is
natural as it matches each key with the corresponding value.
Demonstration
As an example, let us tackle a statistical query on a dataset. We will play with the Hair Eye Color dataset which looks like this:
"","Hair","Eye","Sex","Freq" "1","Black","Brown","Male",32 "2","Brown","Brown","Male",53 "3","Red","Brown","Male",10 "4","Blond","Brown","Male",3 "5","Black","Blue","Male",11
Before we query, let us import the data which is a good exercise for CSV parsing in Emacs:
(setq csv-dataset-file (expand-file-name "HairEyeColor.csv" "~/Downloads")) (defun read-lines (file) ;; Take your pick: custom, find-file, f-read-text (with-temp-buffer (insert-file-contents-literally file) (split-string (decode-coding-region (point-min) (point-max) 'utf-8 t) "\n" t))) (defun read-csv-field (field) ;; Remove the pesky enclosed double quoting (cond ((zerop (length field)) "") ((and (string-equal (substring-no-properties field 0 1) "\"") (string-equal (substring-no-properties field -1) "\"")) (substring-no-properties field 1 -1)) (t (string-to-number field)))) (defun read-csv-line (line) (let ((raw-fields ;; This assumes no rogue commas need escaping (split-string line ","))) (mapcar #'read-csv-field raw-fields))) (defun read-csv-file (file) (mapcar #'read-csv-line (read-lines file))) (setq dataset (read-csv-file csv-dataset-file)) ;; Generated output (("" "Hair" "Eye" "Sex" "Freq") ("1" "Black" "Brown" "Male" 32) ("2" "Brown" "Brown" "Male" 53) ("3" "Red" "Brown" "Male" 10) ("4" "Blond" "Brown" "Male" 3))
Libraries exist for this purpose such as csv
, el-csv
or
parse-csv
and utilities like f
or parsec
but doing it yourself
is an occasional opportunity to review the basics. With that handled,
what can we ask of the data? What is the frequency of eye colors?
Before that, we need to group the data:
(setq headers (car dataset) ;; Only mentioned, not needed records (cdr dataset)) (defun group-by (f xs) ;; A quick write on -group-by (let ((groups (list))) (mapc (lambda (x) (let* ((key (funcall f x)) (key-group (assoc key groups))) (unless key-group (push (cons key (list)) groups) (setq key-group (assoc key groups))) (setcdr key-group (cons x (cdr key-group))))) xs) groups)) (defun group-records-by-eye-color (records) (let* ((raw-eye-groups (group-by (apply-partially #'nth 2) records)) (eye-groups (mapcar (lambda (eye-group) (pcase-let ((`(,eye-color . ,eye-records) eye-group)) (let ((eye-frequencies ;; Extract frequencies and collect it (mapcar (apply-partially #'nth 4) eye-records))) (cons eye-color (apply #'+ eye-frequencies) ;; Shiv for sum )))) raw-eye-groups))) eye-groups)) (setq eye-color-groups (group-records-by-eye-color records)) ;; Generated output (("Green" . 64) ("Hazel" . 93) ("Blue" . 215) ("Brown" . 220))
After writing the extraction, getting the bar chart is straightforward.
(chart-bar-quickie 'horizontal "Eye Colors" (mapcar #'car eye-color-groups) "Colors" (mapcar #'cdr eye-color-groups) "Frequency")
Neat but let's arrange it by descending order:
(defun on (f op) ;; Haskell's on operator (lexical-let ((f f) ;; Sad that parameters aren't lexically scoped here (op op)) (lambda (left right) (funcall op (funcall f left) (funcall f right))))) (chart-bar-quickie 'horizontal "Eye Colors - Descending" (mapcar #'car eye-color-groups) "Colors" (mapcar #'cdr eye-color-groups) "Frequency" nil (on #'cdr #'>) ;; A comparator lambda also works but done for variety )
Looking at it, brown and blue eye colors are quite frequent. Nothing groundbreaking or epic. How about applying the same process with hair color? We get the following chart:
With hair color as the variable, brown and blonds are many. Nothing fancy. We could change the variable into gender but nothing more would be gained. The real question for any visualization is what does it mean? Rather, what insights can we interpret from the data? Since it is a sample, it might not mean anything nor does it have to. This is a library exploration, not a statistical lesson that is beyond scope.
Conclusion
Strangely, that is all there is to it: a neat and easy builtin bar chart library. Since it is not pixel precise, it does not compete with the modern visualizations tools or languages such as Python, R or Haskell although it does not have to. It simply is nice to have and fun to know.
Interestingly, if bar charts can be made with ASCII, how about pies or bubbles. Can Emacs be a modern visualization tool?