ndjson - Wicked-Fast Streaming 'JSON' ('ndjson') Reader
Streaming 'JSON' ('ndjson') has one 'JSON' record per-line and many modern 'ndjson' files contain large numbers of records. These constructs may not be columnar in nature, but it is often useful to read in these files and "flatten" the structure out to enable working with the data in an R 'data.frame'-like context. Functions are provided that make it possible to read in plain 'ndjson' files or compressed ('gz') 'ndjson' files and either validate the format of the records or create "flat" 'data.table' structures from them.
Last updated
jsonndjsonr-cyberzlibcpp
8.04 score 56 stars 9 dependents 136 scripts 3.6k downloadsstatebins - Create United States Uniform Cartogram Heatmaps
The cartogram heatmaps generated by the included methods are an alternative to choropleth maps for the United States and are based on work by the Washington Post graphics department in their report on "The states most threatened by trade" (<http://www.washingtonpost.com/wp-srv/special/business/states-most-threatened-by-trade/>). "State bins" preserve as much of the geographic placement of the states as possible but have the look and feel of a traditional heatmap. Functions are provided that allow for use of a binned, discrete scale, a continuous scale or manually specified colors depending on what is needed for the underlying data.
Last updated
cartogramchoroplethchoropleth-mapdata-visualizationdatavisualizationstatebins
7.57 score 125 stars 2 dependents 333 scripts 751 downloadsvegalite - Tools to Encode Visualizations with the 'Grammar of Graphics'-Like 'Vega-Lite' 'Spec'
The 'Vega-Lite' 'JavaScript' framework provides a higher-level grammar for visual analysis, akin to 'ggplot' or 'Tableau', that generates complete 'Vega' specifications. Functions exist which enable building a valid 'spec' from scratch or importing a previously created 'spec' file. Functions also exist to export 'spec' files and to generate code which will enable plots to be embedded in properly configured web pages. The default behavior is to generate an 'htmlwidget'.
Last updated
data-visualizationdatavisualizationvega-litevega-lite-specvisualizationwidget
7.56 score 157 stars 77 scripts 323 downloadsdocxtractr - Extract Data Tables and Comments from 'Microsoft' 'Word' Documents
'Microsoft Word' 'docx' files provide an 'XML' structure that is fairly straightforward to navigate, especially when it applies to 'Word' tables and comments. Tools are provided to determine table count/structure, comment count and also to extract/clean tables and comments from 'Microsoft Word' 'docx' documents. There is also nascent support for '.doc' files.
Last updated
docxextract-tablesmicrosoft-wordtable-extraction
7.20 score 178 stars 222 scripts 1.1k downloadshrbrthemes - Additional Themes, Theme Components and Utilities for 'ggplot2'
A compilation of extra 'ggplot2' themes, scales and utilities, including a spell check function for plot label fields and an overall emphasis on typography. A copy of the 'Google' font 'Roboto Condensed' is also included.
Last updated
6.65 score 1 dependents 18k scripts 14k downloadssergeant - Tools to Transform and Query Data with Apache Drill
Apache Drill is a low-latency distributed query engine designed to enable data exploration and analysis on both relational and non-relational data stores, scaling to petabytes of data. Methods are provided that enable working with Apache Drill instances via the REST API, DBI methods and using 'dplyr'/'dbplyr' idioms. Helper functions are included to facilitate using official Drill Docker images/containers.
Last updated
apache-drilldplyrdrillparquet-filesr-cybersql
6.31 score 125 stars 65 scripts 271 downloadsqrencoder - Quick Response Code (QR Code) / Matrix Barcode Creator
Quick Response codes (QR codes) are a type of matrix bar code and can be used to authenticate transactions, provide access to multi-factor authentication services and enable general data transfer in an image. QR codes use four standardized encoding modes (numeric, alphanumeric, byte/binary, and kanji) to efficiently store data. Matrix barcode generation is performed efficiently in C via the included 'libqrencoder' library created by Kentaro Fukuchi.
Last updated
qrcodeqrcode-generatorcpp
6.06 score 62 stars 1 dependents 62 scripts 324 downloadsdarksky - Tools to Work with the 'Dark Sky' 'API'
Provides programmatic access to the 'Dark Sky' 'API' <https://darksky.net/dev/docs>, which provides current or historical global weather conditions.
Last updated
darkskydarksky-apidarksky-api-powereddarksky-weather-apidarkskyapiweatherkit
5.46 score 81 stars 36 scripts 625 downloadsepidata - Tools to Retrieve Economic Policy Institute Data Library Extracts
The Economic Policy Institute (<http://www.epi.org/>) provides researchers, media, and the public with easily accessible, up-to-date, and comprehensive historical data on the American labor force. It is compiled from Economic Policy Institute analysis of government data sources. Use it to research wages, inequality, and other economic indicators over time and among demographic groups. Data is usually updated monthly.
Last updated
5.45 score 20 stars 28 scripts 253 downloadsvoteogram - United States House and Senate Voting Cartogram Generators
'ProPublica' <https://projects.propublica.org/represent/> makes United States Congress member votes available and has developed their own unique cartogram to visually represent this data. Tools are provided to retrieve voting data, prepare voting data for plotting with 'ggplot2', create vote cartograms and theme them.
Last updated
data-visualisationdata-visualizationdatavisualization
5.44 score 42 stars 13 scripts 194 downloadswand - Retrieve Magic Attributes from Files and Directories
MIME types are shorthand descriptors for file contents and can be determined from "magic" bytes in file headers, file contents or intuited from file extensions. Tools are provided to perform curated "magic" tests as well as mapping MIME types from a database of over 1,800 extension mappings.
Last updated
filemagic-bytesr-cyber
4.97 score 19 stars 3 dependents 11 scripts 226 downloadslongurl - Expand Short 'URLs'
Tools are provided to expand vectors of short URLs into long 'URLs'. No 'API' services are used, which may mean that this operates more slowly than 'API' services do (since they usually cache results of expansions that every user of the service requests). You can setup your own caching layer with the 'memoise' package if you wish to have a speedup during single sessions or add larger dependencies, such as 'Redis', to gain a longer-term performance boost at the expense of added complexity.
Last updated
r-cyberurlurl-shortener
4.43 score 32 stars 17 scripts 253 downloadscymruservices - Query 'Team Cymru' 'IP' Address, Autonomous System Number ('ASN'), Border Gateway Protocol ('BGP'), Bogon and 'Malware' Hash Data Services
A toolkit for querying 'Team Cymru' <http://team-cymru.org> 'IP' address, Autonomous System Number ('ASN'), Border Gateway Protocol ('BGP'), Bogon and 'Malware' Hash Data Services.
Last updated
bogonsmalware-hash-registryr-cyberteam-cymru-webservice
3.62 score 6 stars 14 scripts 168 downloads