Posts archived in infos

0 comments

Book shoppin’…

I honestly have no book on R programming. In fact I have not a single book on programming at all (my coding proves that ;x). I am pretty sure that I am gonna order (just did!) that book.

You can get a look of Matloff’s text here (= pdf for ya)

0 comments

Data is everywhere!

I was writing earlier today that I am getting really fed to using the same datasets over and over again. Of course using the same data over time with different methods (eg look this) serves really well on a comparison scope but still we can use other data in a web world. For example, you can get interesting sets from BuzzData (or similar services).

Η φωτογραφία προφίλ του χρήστη Manos Parzakonis  -  12:31 μ.μ.  -  Δημόσιο

Plz don’t make me type

data(BostonHousing)

ever again. I hate this dataset ;x
#rstats#rant

PS: Of course you can add me to your Google+ circles…

0 comments

Data pains…

It’s official that discrete data and contigency tables can be the greatest of all the data analyzing pains!

Fact checked…;(

0 comments

Blogs to visit…

Peter Skomoroch came up with a list of blogs with “data (analysis)” as their core subject. There you go…I post the first few, you can follow th link to the page of the original post to get more. Btw, I must get my blogroll/links list ready at some point….

[source]

I found time and read Gelman and Hill’s “Data Analysis Using Regression and Multilevel / Hierarchical Models“…Now, please do yourself a favour and get it (of course the paperback version ;) ). Even for experienced or intermediate (myself) this will be a treat for your eyes and neurons.

PS : (Confession) I didn’t like the Bayesian Data Analysis book.
0 comments

Who’s joking?

A little joke from Gary Ramseyer’s collection of statistical jokes

A one-way anova shouted at a two-way anova: “stop! Turn around – you are going the wrong way!”
The two-way anova yelled back: “sorry! I will turn when i see an interaction!”

[source]

From the book website :

IPSUR stands for Introduction to Probability and Statistics Using R,
ISBN: 978-0-557-24979-4, which is a textbook written for an
undergraduate course in probability and statistics. The approximate
prerequisites are two or three semesters of calculus and some linear
algebra in a few places. Attendees of the class include mathematics,
engineering, and computer science majors.

Now, there is a new way to read R books (anyway, new to me!)

1
2
3
install.packages("IPSUR", repos="http://cran.r-project.org")
library(IPSUR)
read(IPSUR)

I was flicking thru some papers I have printed in the last years and definetely Breiman’s is one of my favorite. A pragmatic and insightful reading for a new statistician (or data analyst if you prefer;)).

As I left consulting to go back to the university,these were the perceptions I had about working with data to find answers to problems:

(a) Focus on finding a good solution—that’s what consultants get paid for.
(b) Live with the data before you plunge into modeling.
(c) Search for a model that gives a good solution, either algorithmic or data.
(d) Predictive accuracy on test sets is the criterion for how good the model is.
(e) Computers are an indispensable partner.

Read more

Leo Breiman (2001), Statistical modeling: The two cultures, Statistical Science, 16:199-231 [pdf]

is maps and geographical data representation in R.

In case you’re curious too this is a good study material from R-Bloggers :

maps ; geographical ; spatial

Ok. This could be a tweet rather than a post…

Computational Finance and Financial Engineering
1st R/Rmetrics Summer School and 4th User/Developer Meeting
Meielisalp, Lake Thune Switzerland, June 27 – July 1, 2010
www.rmetrics.org
Read the rest of this entry »

Real Time Web Analytics