What a nice Christmas surprise!

‘photobiology’ is one of the nine R packages highlighted by METACRAN today as trending this week.

What a nice Christmas surprise!

‘photobiology’ is one of the nine R packages highlighted by METACRAN today as trending this week.

Leading scientific journal Nature reported on Wednesday about a maximum lifespan for humans. But are their statistics right?

Source: *Nature article is wrong about 115 year limit on human lifespan – NRC*

A small set observations with a few extreme observations plus subjective splitting of a data set into two subsets to be fitted separately to a linear regression model resulted in very clear cut conclusions and striking figures. However, none of this is solid evidence, or evidence at all supporting the paper’s conclusions. This series of articles, not only discusses the problems in the paper, but more importantly, it traces the review process that allowed it to be published in Nature.

A new analysis of the data appears in an article at “Ask a Swiss” but still based on model fitting. They detect a significant change in slope, but still we do not have confidence bands available.

A new paper uses flawed methods to predict likely criminals based on their facial features.

Source: *Put Away Your Machine Learning Hammer, Criminality Is Not A Nail*

This is a very clear-cut example of a case of not being aware of the limitations of the data. The same problem affects the interpretation of data from any survey or experiment and any statistical method, machine learning to linear regression or t-tests. The design of the survey or experiment limits the range of validity of possible conclusions and the nature of such conclusions. No sophisticated analysis can correct for biased data unless the bias is measured. We cannot extract information that is not already in the data. We need to make assumptions, but this imposes limitations to what can be concluded. Less dangerous, but equally wrong interpretations of data analyses are a lot more frequent in the scientific literature than we usually realize. Look for parallels between this example and your own research or what you read. What are the data representative of? and meaningful for?

Taiteen väitöskirja jäi Markus Rissaselta syrjään, kun laatat vaativat kokopäiväisen huomion.

A very interesting example of a long unsolved math/geometry problem solved by a PhD student in arts, using ingenuity, lots of paper, pencils and a calculator.