Publications by Sharp Sight

How to map geospatial data: USA rivers

07.02.2017

R Code Here’s the R code to produce the map: #=============== # LOAD PACKAGES #=============== library(tidyverse) library(maptools) #=============== # GET RIVER DATA #=============== #========== # LOAD DATA #========== #DEFINE URL # - this is the location of the file url.river_data <- url("http://sharpsightlabs.com/wp-content/da...

4890 sym R (3898 sym/5 pcs) 10 img

One thing critical for success in the age of AI (and who has it)

14.02.2017

Data science, artificial intelligence, automation, and other advanced technologies are reshaping the world. We’re starting to see glimpses of it, for example the rapid emergence of self driving cars. Moreover, the changes brought about by technology will likely become more frequent and more dramatic in the next few years. In fact, two MIT econ...

3678 sym R (5720 sym/1 pcs) 2 img

How to make a global map in R, step by step

21.02.2017

In the last several blog posts at Sharp Sight, I’ve created several different maps. Maps are great for practicing data visualization. First of all, there’s a lot of data available on places like Wikipedia that you can map. Moreover, creating maps typically requires several essential skills in combination. Specifically, you commonly need t...

11825 sym R (5689 sym/16 pcs) 6 img

How to annotate a plot in ggplot2

28.02.2017

“Master the basics.” That’s a common mantra here at Sharp Sight. Loyal readers know what I mean by “master the basics.” To master data science, you need to master the foundational tools. That means knowing how to create essential plots like: Bar charts Line charts Histograms And performing data manipulations like: Creating new variab...

5719 sym R (450 sym/2 pcs) 6 img

Mapping “France at night” with the new sf package

07.03.2017

A few days ago, I was inspired by a set of photographs of Earth from space, at night. The images are amazing, so I decided to try to replicate them using R. To be a little more specific, I found a single dataset – a data set with French population data – and I’m using it to replicate the images for a single country: France. A few notes and ...

5546 sym R (4873 sym/9 pcs) 8 img

How to choose a project to practice data science

14.03.2017

Here at Sharp Sight, I’ve derided the “jump in and build something” method of learning data science for quite some time. Learning data science by “jumping in” and starting a big project is highly inefficient. However, projects can be extremely useful for practicing data science and refining your skillset, if you know how to select the ...

17738 sym 2 img

The one thing you need to master data science

21.03.2017

When you ask people what makes a person great – what makes someone an elite performer – they commonly say “talent.” Most people believe that elite performers are born with their talent. Most people believe that top performers come into the world with an innate talent that makes them special. You see something like this in data science...

18584 sym R (68 sym/1 pcs) 2 img

Are you fluent in R?

30.03.2017

A few weeks ago, I wrote an article saying that you should master R. The basic argument, is that if you want to actually work as a data scientist, you need to know the essential tools backwards and forwards. In response, a reader left a comment. I have to say that it’s unfortunate to read something like this, but sadly it’s very common. ...

11560 sym 2 img

Simple practice: basic maps with the Tidyverse

15.08.2017

To master data science, you need to practice. This sounds easy enough, but in reality, many people have no idea how to practice. A full explanation of how to practice is beyond the scope of this blog post, but I can give you a quick tip here: You need to master the most important techniques, and then practice those techniques in small scripts unt...

4379 sym R (1087 sym/6 pcs) 14 img

Simple practice: data wrangling the iris dataset

23.08.2017

In last weeks post, I emphasized the importance of practicing R and the Tidyverse with small, simple problems, drilling them until you are competent. In that post, I gave you a few very small scripts to practice (which I suggest that you memorize). This week, I want to give you another small example. We’re going to clean up the iris dataset. M...

5173 sym R (787 sym/1 pcs)