.vimrc Scraping Data

Methodology

I wrote a casper.js driven scraper to download .gvimrc or .vimrc files from paginated github.com search results. I parsed those files into User :has_many Lines SQLite3 tables with ActiveRecord. I cleaned up odd characters and whitespace, and then explored using ruby via irb and the Firefox SQLite Manager.

Summary Statistics

Overview
Files Scraped 777
Lines Parsed 185,582
Max / Min Lines 1,339 / 1* (an include directive)
Mean Lines / File 240
Line Breakdown
Cruft
Comments 44,102
Blank 34,367
Settings
set 22,175
let 6,078
Mappings
map 4,757
nnoremap 3,189
nmap 3,077
inoremap 912
noremap 852
imap 584
Commands
autocmd 2,965
au 1,491
Miscellaneous
if 1,979
endif 1,946
Pathogen 1,525
Bundle 1,230
syntax 623
color |
colorscheme
287

Selected Settings Distributions

history
tabstop
scrolloff
codewidth
guifont

Color Findings

Summary
Number of Records287
Number of Unique Schemes81
Dark : Light242 : 45
Top 5 Color Schemes
Solarized 69
Desert 22
Molokai 18
IR_black 15
Railscasts 14

Distribution

Boolean Setting Rates

This table shows the number of files out of 777 that had the settings turned on. (full results hidden in DOM)