Applications Google
Menu principal

Post a Comment On: Backreaction

"Can we measure scientific success? Should we?"

20 Comments -

1 – 20 of 20
Blogger Captain InterStellar said...

Hi Sabine,

Interesting post. I think there can a problem with measuring scientific success if one goes by the number of papers published, quality is not measured by quantity. Let's say an institution published 100 papers which can't be verified by experiment, should this institution get tax payer funding? What benefit is there to Science if none of the theoretical work can be verified?

I would think that scientific success is measured by how much real positive impact the research has had or can have to our understanding of the Universe or how much real life changing benefits it has brought or can bring to people or how we do things. If the theoretical research is of high quality but can't be verified by experiment because it's a technology issue then it needs to be shelved and it's scientific success later determined when the time is right.


Cheers, Paul.

11:07 AM, August 25, 2013

Blogger Uncle Al said...

Hectares of scholarly analysis paper the Sistine Chapel ceiling. Sepia light and shadow, characters' allegorical political intrigues are exhaustively explored. Soot removal exposed comic book-bright pigmentation. Global gender misassignments hid under post-painting censorship. The analyses are wrong.

Physics observes vacuum mirror-symmetry toward photons. Theory assumes vacuum mirror-symmetry toward matter. Theory suffers parity violations and chiral anomalies unendingly patched with symmetry breakings. An axiomatic system must be externally falsified, even if incomplete (Euclid vs. Bolyai) or unphysical (Newton vs. relativity and quantum mechanics). Theory that contradicts observation is wrong.

Opposite shoes violate the Equivalence Principle. Five distinct classes of experiment are exquisitely sensitive to trace vacuum chiral anisotropy toward matter, where physics never looked. A microwave chiral rotational spectrum experiment is one day. String/M-theory, SUSY, and dark matter cannot endure a tenth part-per-trillion divergence. How does one rate epicycles? Workmanship?

11:17 AM, August 25, 2013

Comment deleted

This comment has been removed by the author.

11:37 AM, August 25, 2013

Blogger Phil Warnell said...

Hi Bee,

There is a lot I find good with what you say here with which I agree. However I think one of the things you seem to be skirting around, which is what also forms to be the most difficult (if not impossible) of all things to have distinguished by way of measure, being what has people to become scientists to begin with. I would say that the scrutiny of one’s work on its own, especially when a scientist is just beginning, is a completely unreliable measure, and thus why what it is that has someone brought to science should be given greater weight than it currently is. That is I think the character of the researcher to be important, as when that is found to be right it speaks as well to the issues with which many of the metrics are attempting to have measured and yet found wanting in being able to. Perhaps interestingly in such regard Einstein did have something to say, and yet he too was left somewhat baffled as to be able to define exactly what it was or imagine how it could be quantified and yet still found he recognized it whenever it was present.


” In the temple of science are many mansions, and various indeed are they that dwell therein and the motives that have led them thither. Many take to science out of a joyful sense of superior intellectual power; science is their own special sport to which they look for vivid experience and the satisfaction of ambition; many others are to be found in the temple who have offered the products of their brains on this altar for purely utilitarian purposes. Were an angel of the Lord to come and drive all the people belonging to these two categories out of the temple, the assemblage would be seriously depleted, but there would still be some men, of both present and past times, left inside. Our Planck is one of them, and that is why we love him.

I am quite aware that we have just now lightheartedly expelled in imagination many excellent men who are largely, perhaps chiefly, responsible for the buildings of the temple of science; and in many cases our angel would find it a pretty ticklish job to decide. But of one thing I feel sure: if the types we have just expelled were the only types there were, the temple would never have come to be, any more than a forest can grow which consists of nothing but creepers. For these people any sphere of human activity will do, if it comes to a point; whether they become engineers, officers, tradesmen, or scientists depends on circumstances.

Now let us have another look at those who have found favor with the angel. Most of them are somewhat odd, uncommunicative, solitary fellows, really less like each other, in spite of these common characteristics, than the hosts of the rejected. What has brought them to the temple? That is a difficult question and no single answer will cover it.”


-Albert Einstein, “Principles of Research”, Address at the Physical Society, Berlin, for Max Planck's 60th birthday (1918)


Regards,

Phil

11:43 AM, August 25, 2013

Blogger scimom said...

The most crucial observation, which is however not addressed in this blogpost, is that there is no evidence the measure matters.

I come from HEP, where I had a chance to meet hundreds of people, many of which I got to know well in person. I have never seen a case in which the hiring/grants system failed. Every person that I have ever deemed worthy of a job or of a grant indeed got it plus minus small fluctuations. Then there are people in the gray zone, for which there may be bigger upwards and downwards fluctuations, but it does not matter anyway since such people do not end up making a mark on science.

Therefore, I dont see why we need to optimize these measures or chase our tails with feedback loops.

I would go as far as to say that I have not witnessed real injustice even in the postdoc market, just small fluctuations here and there.

12:31 PM, August 25, 2013

Blogger Daniel Lemire said...

I'm with scimom. It is all good and well to ask for better and more thorough measures, and claim that this will make things better... but I also think that we need to assess these claims empirically.

Is it true that "better" measures allow you to recruit better scientists that you would otherwise?

8:28 AM, August 26, 2013

Blogger Peter Turney said...

"Is it true that "better" measures allow you to recruit better scientists that you would otherwise?"

How can you even answer that question, except by introducing another measure, and comparing the given measure to the new measure?

8:56 AM, August 26, 2013

Blogger B Yen/Getty Images [ iTunes demo ] said...

"I'm Locally Pessimistic, Globally Optimisitc"
-- Dr Jordan Pollack, Brandeis Univ Computer Science Prof, former grad-school colleague of mine (UIUC)

He made this statement during an interesting Slashdot.org interview. Current state of research-sector, is based on INCOMPLETE data-set leading to temporal theories. "Ill-conditioned problem" -- there are multiple theories that can explain the partial data-set. Successively accumulated data always asks "more questions", leading to modified theories (or new ones altogether)

How is it possible to "measure" progress, if one doesn't know the "global data-set"? The analogy of successively peeling onion layers is used to describe Science process, you can't figure out "where you are", by the current "layers of development". I suppose you can extrapolate given the previous "conquered layers"..but still there's a paradox.

"Predicting the Future..is PREDICTABLE UNPREDICTABLE"

Look at how any sector has developed, especially Technology. Some of the stuff was completely "out of the blue", serendipitous/random.

See next post for THREE Mass-Consumer market sectors, that came from US Military "seeding". NO WAY was that predictable. How could this "future development" been predicted back then (60's), or "progress measured"? I think the model is Evolution,

"It's not the Smartest, the Strongest, but the Species that is most ADAPTABLE TO CHANGE, that survives"
-- Charles Darwin

9:25 AM, August 26, 2013

Blogger B Yen/Getty Images [ iTunes demo ] said...

More interesting comments by Dr Pollack from Edge.org

http://edge.org/response-detail/10553

"A measurement of innovation rate.

There is no measure of the rate at which processes like art, evolution, companies, and computer programs innovate.

Consider a black box that takes in energy and produces bit-strings. The complexity of a bit-string is not simply its length, because a long string of all 1's or all 0's is quite simple. Kolmogorov measures complexity by the size of the smallest program listing that can generate a string, and Bennet's Logical Depth also accounts for the cost of running the program. But these fail on the Mandelbrot Set, a very beautiful set of patterns arising from a one-line program listing. What of life itself, the result of a simple non-equilibrium chemical process baking for quite a long time? Different algorithmic processes (including fractals, natural evolution, and the human mind) "create" by operating as a "Platonic Scoop," instantiating "ideals" into physical arrangements or memory states.

So to measure innovation rate (in POLLACKS) we divide the P=Product novelty (assigned by an observer with memory) by the L=program listing size and the C= Cost of runtime/space/energy.

Platonic Density = P / LC"

"Knowing is not enough, we must Apply"
"Willing is not enough, we must Do"
-- Bruce Lee

9:28 AM, August 26, 2013

Blogger Uncle Al said...

Fabricate fashionable rating paradigms. Hire only quantified safely qualified, countably productive ink pushers. 1935 Wallace Carothers discovered nylon, abominating proper knowledge. He also committed suicide. Ban nylon.

Eldritch talents are intellectual terrorists. The stuff that was coming out of him consisted of words, but it was not speech in the true sense: it was a noise uttered in unconsciousness, like the quacking of a duck..., 1984. Management says, "Quack, damn you."

12:03 PM, August 26, 2013

Blogger Arun said...

W. Edwards Deming, "Out of the Crisis", ~1985,

Measures of productivity are like statistics on accidents: they tell you all about the number of accidents in the home, on the road and at the workplace, but they do not tell you how to reduce the frequency of accidents.

...
Some leaders forget an important mathematical theorem that if 20 people are engaged on a job, 2 will fall at the bottom 10 per cent, no matter what. It is difficult to overthrow the law of gravitation and laws of nature. The important problem is not the bottom 10 per cent, but who is statistically out of line and in need of help.

...
...
No one can put in his best performance unless he feels secure.

1:50 PM, August 26, 2013

Blogger Phillip Helbig said...

I agree with most of your criticism of various measures, but at the end of the day you need to decide whom to allocate funding to. Giving it to everyone who says "I'm a scientist; give me money" won't work.

4:06 AM, September 02, 2013

Blogger Sabine Hossenfelder said...

Hi Phil,

I agree with you insofar as that almost all of these measures are useless if you're dealing with young people (where by 'young' I don't necessarily mean young by age, but new to the field). But while I agree that motives and motivation is relevant for success, I don't think it makes a good criterion. I've known a bunch of very motivated students who were just, well, not very good at physics. One could say, it's necessary but not sufficient, and since it's necessary it doesn't seem to make much sense 'measuring' it. Which is to say, essentially, if somebody is applying for a job it makes sense to assume that they want to work in the field.

Having said that, the motivation to work at any particular place is another thing and in fact something that is dealt with in unclear ways. For example, we get applications from basically all over the globe but, not so surprisingly, the ones with ties to North Europe are more likely to accept an offer. And this again touches on a much bigger problem which is the annual dance of offers and deadlines which, if you ask me, is a totally archaic procedure.

Best,

B.

5:08 AM, September 03, 2013

Blogger Sabine Hossenfelder said...

Hi Scimom,

I hear you but can't say the same is true for me. I know many people who I deem 'worthy' of a grant or a job and who didn't get those, and on the other hand I know a bunch of dull people who easily got these. For what I am concerned, there is clearly something rotten in the procedure. Note that I'm doing what the NSF calls 'transformative' research, which might explain the difference. Best,

B.

5:11 AM, September 03, 2013

Blogger Sabine Hossenfelder said...

Daniel,

I totally agree that these claims must be assessed empirically. The problem is though that the relevant people, those hiring firing and handing out grants, don't see the need to evaluate and possibly rethink their procedures.

I don't know if better measures would allow you to recruit 'better' scientists. What I've tried to say in my blogpost is that that's the wrong question to ask. We should ask what measures can do to make science run more efficiently, by saving researchers time and effort with what they are already doing anyway. I also think that using measures can prevent sloppiness that comes from the attempt of researchers to save time and effort. That's how I think using measures will help science. Best,

B.

5:16 AM, September 03, 2013

Blogger Phillip Helbig said...

"Smolin argued that the number of single authored papers is a good indicator for a young researcher’s promise. He’s not alone in this belief. Most young researchers are very aware that a single authored paper will put a sparkle on their publication list. But maybe a researcher with many single authored papers just a bad collaborator."

I would say that single-author papers from a young researcher are an indication of quality. However, the converse is not true. There are fields in which almost all papers must have multiple authors, and there are the "Lennon and McCartney types" who always have more than one name on the paper, for various reasons.

10:50 AM, September 03, 2013

Blogger Sabine Hossenfelder said...

Phillip,

Yes, that matches very well with my experience.

2:45 AM, September 04, 2013

Comment deleted

This comment has been removed by the author.

7:24 AM, September 04, 2013

Blogger Phil Warnell said...

Hi Bee,

I wasn’t contending one should place motive and or motivation as to being the overriding factor, more just to suggest with all else being equal (that is more or less), it should form to be what tips the scale. Moreover, I do (and did) agree that such cannot be accessed (well at least at present) by some now known recognized test or method, yet rather something which is considered as to be explored in the final stages of selection; with respect to personal interviews of the whittled down list of candidates and if possible with those who have lent them their personal recommendation. I also think this was the message that Einstein was attempting to have communicated, as recognizing science not simply as an occupation, but rather more so as being a calling, as it’s base formed as one being a philosophy. The contention being that it’s then best represented by those who find its purpose is found in serving to benefit humanity as a whole, rather than simply the individual, or the ambitions of the particular employer (institution) for that matter.

”Anybody who has been seriously engaged in scientific work of any kind realizes that over the entrance to the gates of the temple of science are written the words: Ye must have faith. It is a quality which the scientist cannot dispense with.”

-Max Planck, “Where Is Science Going?”

Regards,

Phil

7:31 AM, September 04, 2013

Blogger amused said...

Hi Bee, a few belated comments on this:

"Most young researchers are very aware that a single authored paper will put a sparkle on their publication list"

Having been such a young researcher I think it is more accurate to say:
"A few naive young researchers think that single authored papers will add sparkle to their publication list, but then after they publish the single authored paper(s) they discover that the reality is not what they expected..."

One of my favorite rhetorical questions to ask Sean Carroll and others of his ilk back in the good old days on Cosmicvariance was:
"In the competition for jobs, how many single-author publications in PRL on a non-string theory topic does it take to balance out a single publication in, say, PRD by a young person as junior collaborator of a senior prominent string theorist and with a bunch of other coauthors on the paper?
(Answer: no number of single-author PRLs will be sufficient.)

Most young researchers are clued on enough to instinctively realize that it will be much better for their careers to be working as junior collaborator and coauthor with prominent senior researchers rather than writing papers on their own. So they don't bother to write singe-author papers, even though some of them probably could if they wanted to.

Regarding measures in general, one thing i think it is important to recognize is that you can get pretty much any outcome you want by choosing the measures in a suitable way. In fact I have the impression that quite often decision makers informally decide the outcome they want (based on their gut feelings mostly) and then chose a collection of measures that produces the desired outcome. It would be more efficient to just put them all together in a room and tell them to make the decision based on their collective gut instinct without the bother of having to cook up measures to formally justify it.

Another thing i think it's important to recognize is that when you or me or anyone else makes proposals for how to measure performance it is inevitable that we are going to favor measures that make us look good personally.
For example, I am convinced deep down in my gut that number of single-author publications in PRL should count for more than anything else when assessing theoretical physicists, especially junior ones. That also just happens to be the measure that would favor me most compared to my peers, but i'm sure that's just a coincidence ;-)
On the other hand, I shuddered when i read the measures you were proposing in the post because i know that I would score terribly on them. But maybe you would score very well on them? ;-)

6:08 AM, October 06, 2013

You can use some HTML tags, such as <b>, <i>, <a>

Comment moderation has been enabled. All comments must be approved by the blog author.

You will be asked to sign in after submitting your comment.
OpenID LiveJournal WordPress TypePad AOL