<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Big Data on Amit Kohli</title><link>https://www.amitkohli.com/tags/big-data/</link><description>Recent content in Big Data on Amit Kohli</description><generator>Hugo -- gohugo.io</generator><language>en</language><copyright>© 2026 Amit Kohli</copyright><lastBuildDate>Tue, 27 Sep 2016 03:53:07 +0000</lastBuildDate><atom:link href="https://www.amitkohli.com/tags/big-data/index.xml" rel="self" type="application/rss+xml"/><item><title>Heart-shaped wordcloud, celebrating Colombia peace treaty</title><link>https://www.amitkohli.com/heart-shaped-wordcloud-celebrating-colombia-peace-treaty/</link><pubDate>Tue, 27 Sep 2016 03:53:07 +0000</pubDate><guid>https://www.amitkohli.com/heart-shaped-wordcloud-celebrating-colombia-peace-treaty/</guid><description>&lt;p&gt;This is a lightening quick post just providing the script to draw a heart-shaped wordcloud, using the awesome &lt;a href="https://github.com/lchiffon/wordcloud2" target="_blank" rel="noreferrer"&gt;This is a lightening quick post just providing the script to draw a heart-shaped wordcloud, using the awesome&lt;/a&gt; package. See the resulting image here:&lt;/p&gt;</description></item><item><title>R Tagosphere!</title><link>https://www.amitkohli.com/r-tagosphere/</link><pubDate>Sun, 31 Jan 2016 17:41:10 +0000</pubDate><guid>https://www.amitkohli.com/r-tagosphere/</guid><description>&lt;p&gt;This post explores the inter-relationships of StackOverflow Tags for R-related questions. So I grabbed all the questions tagged with &amp;ldquo;r&amp;rdquo;, took the other topics: in each question and made some network charts that show how often each tag is seen with the other topics:. The point is to see the empirical relationships that develop as people organically describe their problems with R. &lt;a href="https://github.com/datastrategist/StackOverflow-tag-Network-R" target="_blank" rel="noreferrer"&gt;Full analysis on GitHub&lt;/a&gt;, as always.&lt;/p&gt;</description></item><item><title>Clickable list of the best animations since 1900, gathered the geek way.</title><link>https://www.amitkohli.com/list-of-the-best-animations-since-1900-gathered-the-geek-way/</link><pubDate>Thu, 22 Oct 2015 13:51:20 +0000</pubDate><guid>https://www.amitkohli.com/list-of-the-best-animations-since-1900-gathered-the-geek-way/</guid><description>&lt;p&gt;&lt;a href="https://i0.wp.com/amitkohli.com/wp-content/uploads/2015/10/images.jpg" target="_blank" rel="noreferrer"&gt;&lt;img class="alignnone wp-image-420" src="https://i0.wp.com/amitkohli.com/wp-content/uploads/2015/10/images.jpg?resize=138%2C106" alt="images" width="138" height="106" srcset="https://i0.wp.com/amitkohli.com/wp-content/uploads/2015/10/images.jpg?zoom=2&amp;resize=138%2C106 276w, https://i0.wp.com/amitkohli.com/wp-content/uploads/2015/10/images.jpg?zoom=3&amp;resize=138%2C106 414w" sizes="(max-width: 138px) 100vw, 138px" data-recalc-dims="1" /&gt;&lt;/a&gt;&lt;a href="https://i2.wp.com/amitkohli.com/wp-content/uploads/2015/10/4.jpg" target="_blank" rel="noreferrer"&gt;&lt;img class="alignnone wp-image-423" src="https://i2.wp.com/amitkohli.com/wp-content/uploads/2015/10/2.jpg?resize=137%2C105" alt="2" width="137" height="105" srcset="https://i2.wp.com/amitkohli.com/wp-content/uploads/2015/10/2.jpg?zoom=2&amp;resize=137%2C105 274w, https://i2.wp.com/amitkohli.com/wp-content/uploads/2015/10/2.jpg?zoom=3&amp;resize=137%2C105 411w" sizes="(max-width: 137px) 100vw, 137px" data-recalc-dims="1" /&gt;&lt;img class="alignnone wp-image-422" src="https://i0.wp.com/amitkohli.com/wp-content/uploads/2015/10/3.jpg?resize=139%2C104" alt="3" width="139" height="104" srcset="https://i0.wp.com/amitkohli.com/wp-content/uploads/2015/10/3.jpg?zoom=2&amp;resize=139%2C104 278w, https://i0.wp.com/amitkohli.com/wp-content/uploads/2015/10/3.jpg?zoom=3&amp;resize=139%2C104 417w" sizes="(max-width: 139px) 100vw, 139px" data-recalc-dims="1" /&gt;&lt;img class="alignnone wp-image-421" src="https://i2.wp.com/amitkohli.com/wp-content/uploads/2015/10/4.jpg?resize=158%2C105" alt="4" width="158" height="105" data-recalc-dims="1" /&gt;&lt;/a&gt;&lt;/p&gt;</description></item><item><title>Locations for 75000 dams</title><link>https://www.amitkohli.com/locations-for-75000-dams/</link><pubDate>Tue, 20 Oct 2015 20:30:39 +0000</pubDate><guid>https://www.amitkohli.com/locations-for-75000-dams/</guid><description>&lt;p&gt;The last task I performed for &lt;a href="http://www.fao.org/nr/AQUASTAT" target="_blank"&gt;[[AQUASTAT]]&lt;/a&gt; was to try to find the best way to estimate the anthropogenic evaporation from dams. The paper can be found &lt;a href ="http://www.fao.org/3/bc814e/bc814e.pdf"&gt;here&lt;/a&gt;, but here I provide one of the fun outputs, a map of 75000 dams!&lt;/p&gt;</description></item><item><title>Transboundary surface water flow</title><link>https://www.amitkohli.com/transboundary-surface-water-flow/</link><pubDate>Wed, 04 Mar 2015 19:05:38 +0000</pubDate><guid>https://www.amitkohli.com/transboundary-surface-water-flow/</guid><description>&lt;p&gt;A [[Visualization]] generated for [[AQUASTAT]] of [[FAO]].&lt;/p&gt;
&lt;p&gt;&lt;a href="https://i2.wp.com/amitkohli.com/wp-content/uploads/2015/03/trans.sankey.png" target="_blank" rel="noreferrer"&gt;&lt;img class="alignnone size-medium wp-image-299" src="https://i2.wp.com/amitkohli.com/wp-content/uploads/2015/03/trans.sankey.png?resize=300%2C132" alt="trans.sankey" width="300" height="132" srcset="https://i2.wp.com/amitkohli.com/wp-content/uploads/2015/03/trans.sankey.png?resize=300%2C132 300w, https://i2.wp.com/amitkohli.com/wp-content/uploads/2015/03/trans.sankey.png?resize=700%2C309 700w, https://i2.wp.com/amitkohli.com/wp-content/uploads/2015/03/trans.sankey.png?w=794 794w" sizes="(max-width: 300px) 100vw, 300px" data-recalc-dims="1" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Two types of charts were prepared: Sankey and Network. The Sankey plots allow for an ‘automatic sorting&amp;rsquo; of countries based on who is the water tower and who is the water source. This [[Visualization]] is useful to demonstrate where a country falls on this continuum.&lt;/p&gt;</description></item><item><title>Chord progressions of 5000 songs!</title><link>https://www.amitkohli.com/chord-progressions-of-5-000-songs/</link><pubDate>Sun, 01 Mar 2015 00:00:00 +0000</pubDate><guid>https://www.amitkohli.com/chord-progressions-of-5-000-songs/</guid><description>&lt;p&gt;Update: Full analysis and everything you need at my github &lt;a href="https://github.com/datastrategist/Musical-chord-progressions" target="_blank" rel="noreferrer"&gt;https://github.com/datastrategist/Musical-chord-progressions&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;The &lt;a href="http://www.hooktheory.com/trends" target="_blank"&gt;Hooktheory.com&lt;/a&gt; database contains analyses of over 5000 songs*. These analyses are uploaded by users and allow for all these songs to be analyzed in bulk, as well as individually. One of these ‘all song&amp;rsquo; analyses enables users to gather chord progressions on ALL songs (see the analysis file to see how i did it, using the hooktheory API and R). This allowed us to  create a Sankey [[Visualization]] of all chord progressions in the Hooktheory database.&lt;/p&gt;</description></item></channel></rss>