Because it’s Friday: Infrastructure Collapses

Read More

Seeing Theory: Learn Statistics through simulation

Read More

The Most Popular Languages for Data Scientists/Engineers

Read More

One of the major announcements from yesterday’s Data Amp event was that SQL Server 2017 will add Python as a supported language. Just as with the continued R support, SQL Server 2017 will allow you to process data in the database using any Python function or package without needing to export the data from the …

0 6.5k

Warren Buffett — known as the “Oracle of Omaha” — is one of the most successful investors of all time. Wherever the winds of the market may blow, he always seems to find a way to deliver impressive returns for his investors and his company, Berkshire Hathaway. Every year he authors his famous “shareholder letter” with his …

0 6.5k

There’s an ongoing debate in the academic community about whether Calculus is a necessary pre-requisite for teaching Statistics. But in age of ubiquitous computing resources (not to mention open source programming languages like R), there’s a fair argument to be made that all you really need is simulation. However complex the statistical proposition, you can always find …

0 6.5k

This coming Wednesday, April 19 at 8AM Pacific Time (click for your local time), Microsoft will be hosting a major on-line event of interest to anyone working with big data, analytics, and artificial intelligence: Microsoft Data Amp. During Data Amp, Executive Vice President Scott Guthrie and Corporate Vice President Joseph Sirosh will share how Microsoft’s …

0 6.5k

As anyone who’s tried to analyze real-world data knows, there are any number of problems that may be lurking in the data that can prevent you from being able to fit a useful predictive model: Categorical variables can include infrequently-used levels, which will cause problems if sampling leaves them unrepresented in the training set. Numerical …

0 6.5k

Maëlle Salmon recently created a collage of profile pictures of people who use the #rstats hashtag in their Twitter bio to indicate their use of R. (I’ve included a detail below; click to see the complete version at Maëlle’s blog.) Naturally, Maëlle created the collage using R itself. Matching Twitter bios were found using the search_users function in the …

0 6.5k

Reddit conducted an interesting social experiment last weekend. It provided all of its users with a blank canvas, and the ability to color its pixels according to just three simple rules: You can… … Source link

0 6.5k

by Srini Kumar, VP of Product Management and Data Science, LevaData; and Bob Horton, Senior Data Scientist, Microsoft A rational function is defined as the ratio of two functions. The Padé Approximant uses a ratio of polynomials to approximate functions: $$ R(x)= frac{sum_{j=0}^m a_j x^j}{1+sum_{k=1}^n b_k x^k}=frac{a_0+a_1x+a_2x^2+cdots+a_mx^m}{1+b_1 x+b_2x^2+cdots+b_nx^n} $$ Here we show a way to …

0 6.5k

The results of the 2017 StackOverflow Survey of nearly 65,000 developers were published recently, and includes lots of interesting insights about their work, lives and preferences. The results include a cross-tabulation of the most popular languages amongst the “Data Scientist/Engineer” subset, and the results were … well, surprising: When thinking about data scientists, it certainly makes …

0 6.5k

At the recent Strata conference in San Jose, several members of the Microsoft Data Science team presented the tutorial Using R for Scalable Data Analytics: Single Machines to Spark Clusters. The materials are all available online, including the presentation slides and hands-on R scripts. You can follow along with the materials at home, using the Data Science …

0 6.5k