Category Archives: The Scientific Process

All Higgs All the Time(s)

As many of you will have already noticed, today’s Science Times section of the New York Times newspaper is devoted to articles by Dennis Overbye on the search for the Higgs particle.  At first read, the articles seem pretty good; several key players are interviewed (though inevitably, given page constraints, a number of important players in the experiments are not mentioned) and the science seems mostly accurate, with a few small errors, omissions, or misleading ways of saying things in the glossary and elsewhere.  I’m busy preparing a new public talk for tomorrow, so I’ll have to reserve any detailed comments for later in the week.

But one thing you will notice, if you read the long article which describes the ins and outs of the search process, is that several of the responsible scientists quoted indicate, directly or indirectly, that the December 2011 data did not convince them that a Higgs particle had yet been found.  That was the position I took on this blog, and I reported to you that most responsible scientists I had spoken to (which didn’t happen to include any of the ones quoted in the Science Times today) viewed the December data as inconclusive — meaning that it was still quite possible that the apparent signal of a Higgs particle might evaporate.  Almost every other major particle physics blogger disagreed with me, both on my opinion and on my characterization of others’ opinions.  But I stand by my statements: that though the data reported in July 2012 was essentially definitive, the data in December 2011 was, not only from my perspective but from that of many serious scientists, suggestive yet inconclusive.  And you can now read that in the New York Times.

Wednesday: Sean Carroll & I Interviewed by Alan Boyle

On Wednesday February 6th, at 9 pm Eastern/6 pm Pacific time, Sean Carroll and I will be interviewed by Alan Boyle on “Virtually Speaking Science”.   The link where you can listen in (in real time or at your leisure) is http://www.blogtalkradio.com/virtually-speaking-science/2013/02/07/sean-carroll-matt-strassler-alan-boyle

What is “Virtually Speaking Science“?  It is an online radio program that presents, according to its website:

  • Informal conversations hosted by science writers Alan Boyle, Tom Levenson and Jennifer Ouellette, who explore the explore the often-volatile landscape of science, politics and policy, the history and economics of science, science deniers and its relationship to democracy, and the role of women in the sciences.

Sean Carroll is a Caltech physicist, astrophysicist, writer and speaker, one of the founders of the blog Cosmic Variance, who recently completed an excellent popular book (which I highly recommend) on the Higgs particle, entitled “The Particle at the End of the Universe“.  Our interviewer Alan Boyle is a noted science writer, author of the book “The Case for Pluto“, winner of many awards, and currently NBC News Digital’s science editor [at the blog  "Cosmic Log"].

I was interviewed on Virtually Speaking Science once before, by Tom Levenson, about the Large Hadron Collider (here’s the link).  Also, my public talk “The Quest for the Higgs Particle” is posted in their website (here’s the link to the audio and to the slides).

 

Solution to Yesterday’s Puzzle: Higgs, Found!

Of the 20 plots shown in yesterrday's post, the five marked in red have no Higgs signal, and contain nothing but statistical fluctuations on a background that, with infinite statistics, would give a flat horizontal histogram.

Fig. 1: Of the 20 plots shown in yesterday’s post, the five marked in red have no Higgs signal, and contain nothing but statistical fluctuations on a background that, with infinite statistics, would give a flat horizontal histogram.

Well, I hope some of you attempted yesterday’s exercise, which involved looking at a bunch of plots with simulated data, and trying to figure out in each plot

  • is there a signal of the “Higgs particle” in the plot?
  • what roughly is the mass of the “Higgs particle” (which I assured you lies, for each plot, in the range 122-127 GeV/c²?)

So now let’s see how well you did, and also what the implications are for the 3 GeV/c² discrepancy between the two measurements by the ATLAS experiment of the Higgs particle’s mass — a discrepancy which led to a big discussion (which ATLAS did not encourage or generate, mind you).  Some speculated (and others chose to report this speculation in the news media)  that there might be evidence in this data for two Higgs particles. (This is contradicted by CMS’s result from November, but hey, who’s counting?)  But the substantive question you might ask is:  Is this such a big discrepancy that either there are two Higgs particles or ATLAS has made a big mistake?

With that in the back of our minds, let’s take a look at the exercise I proposed yesterday, and see how things go.  Note, however, that this exercise does not have direct or precise implications for ATLAS’s discrepancy.  First, these plots are made in crude fashion and without simulating real Higgs particles, though the signal and the background are about the right size to match ATLAS’s current data on a Higgs decaying to two lepton/anti-lepton pairs.  Second, ATLAS uses very sophisticated methods to measure the Higgs mass, much more powerful than you could employ by eye, and much more than these graphs could convey.  Rather, the point here is to convey limitations of human psychology, and illustrate that humans are not naturally skilled at evaluating the statistical properties of small amounts of data.

Which Plots Have a Higgs Signal and Which Do Not?

15 of the 20 plots shown yesterday (in a larger figure that will be easier for you to read) have a Higgs signal; the others, marked with a red line in Figure 1, do not. That probably wasn’t too hard, except for the uppermost plot in the right column, because the plots without a Higgs have close to 40 events while those with a Higgs have close to 60; and the discrepancy is especially big in the range 120-130 GeV/c2 where I told you that the Higgs is to be found.  Some people would have been tripped up by the peak in the bottom left plot, were it not for the fact that I told you the Higgs particle couldn’t be there.

Now on to the next question: What, roughly, is the Higgs mass in each of the 15 plots that have a signal.

Estimate the Higgs’ Mass

This is generally what trips people up the most.  The human brain is an absolutely terrible statistician.  It takes years of practice to unlearn what your brain wants to tell you — graduate students, and even senior theoretical physicists who haven’t stared at enough data, do an awful job with a problem like this.

For example, take Figure 2, which is one of the plots I showed yesterday.  [Recall that the bins are 1 GeV/c² wide, and the bin just to the right of the 2 in "125" is the bin running from 125 to 126 GeV/c².]  Your eye wants to tell you there’s a big peak at 126-127 GeV/c² and so that’s where the Higgs mass should be.  But you’ve been warned that the Higgs peak is 4 GeV/c² wide at half its maximum height — so it shouldn’t just be a one-bin wide spike!  Still, your brain forgets this and your eye misleads you.  If you look carefully, there are rather few events to the right of the highest bin, and more to the left.  The real Higgs mass is quite a bit lower than 126-127 in this case.

Fig 2: One of the 15 plots with a Higgs signal.

Fig. 2: One of the 15 plots with a Higgs signal.

Now what about Figure 3? This looks like it has a double peak! It has one peak between 123 and 124, and another between 126 and 127! But I assure you this was generated by a signal that has a single peak. Your eye, yet again, is fooled. The real Higgs mass in this case lies between the two peaks.

Fig. 3: Another of the 15 plots with a Higgs signal.

Fig. 3: Another of the 15 plots with a Higgs signal.

And Figure 4? This case doesn’t look too hard; there are several events in the 125-126 bin and several in the 126-127 bin, just 2 in the 127-128 bin, and none in the 124-125 bin. So clearly the Higgs mass should be between 125 and 127, probably 126.  But in fact the actual Higgs mass in this case lies in the empty bin!

Fig. 3: A third of the 15 plots with a Higgs signal.

Fig. 4: A third plot with a Higgs signal.

The Truth is Revealed

Well, as the experts who are looking closely will already have guessed, every single one of these 15 plots was generated with a Higgs mass of 124.7 GeV/c2. Every one.

If there were 400 times as many events as in each of the plots from yesterday, here’s what you would see; three of the plots in Figure 5 show what the data looks like with a Higgs peak, and the fourth shows what the random background looks like.

Fig. 5: All the plots that contained a Higgs signal had a Higgs at 124.7 GeV/c2.

Fig. 5: All the plots that contained a Higgs signal had a Higgs at 124.7 GeV/c2, whose shape (with 400 times the amount of data) is shown in three of the above figures, above a flat background. The background alone is shown in the upper right plot.

I assure you there was no funny business in generating the plots from yesterday.  No bias was introduced. Try it yourself if you don’t believe me! The computer program was designed to sample this flat background plus smooth peak at random. When you pick only 20 or so events from such a peak, and another 40 or so from the flat background, you’ll get something that looks quite squiggly. The peak in the resulting plot can easily lie one or even two GeV/c2 away from the true mass, as in Figure 2; there can appear to be two peaks, as in Figure 3; the bin with the true mass can actually be empty, as in Figure 4. And the probability of the plot looking weird in some way, given this small amount of data, should not be underestimated — it’s maybe one in four or five.

The Implications

What’s the point? The photon-based and lepton-based measurements of the Higgs mass at ATLAS differ by 3 GeV/c2, which is bigger than we’ve seen here. So what?  The photon-based mass measurement is 126.6±0.3±0.7 — the first uncertainty number is due to random statistics, the second is called “systematic” and includes experimental defects and other problems.  So it has a systematic uncertainty of nearly a GeV/c2 (which is always ignored by the press, as though uncertainties don’t matter in interpreting what you actually know).  Systematic uncertainties are often not random; they may give an overall shift to all the data, moving the peak uniformly up or down.  And the lepton-based measurement is 123.5±0.9+0.4-0.2 GeV/c², which has an upward systematic uncertainty of almost half a GeV/c2. So if ATLAS got unlucky and their lepton-based result got pulled down by a couple of GeV/c2 purely due to the statistical effects we’ve seen today, and if they have small systematic problems of nearly a GeV/c2 in one or both of their measurements, they could potentially get a 3 GeV/c2 discrepancy.

Is such a big discrepancy likely? No; it is certainly possible, but it is certainly not very likely.

But here you have to remember how statistics works. If instead of asking “is this particular weird phenomenon likely” you had instead asked “is it likely that somewhere, in all of the measurements that ATLAS is making about the Higgs, one or two of them would have turned out weird”, the answer is “very likely indeed!”  I’m writing and you’re reading a post about the mass measurement — and so are other bloggers and Scientific American and all the rest — because currently this is the one that looks odd. It could instead have been that there was a double peak in the photon-based plot, or that the lepton-based plot was showing no peak at all (simply because the signal rate fluctuated down from 20 events to 11, which is just 2 standard deviations.) Or the measurements of the spin and parity of the Higgs could be looking strange.  In that case we’d be talking about those things instead.  In July people were talking about the hint that the Higgs didn’t seem to be decaying to tau leptons; well that’s over and done.

In other words, there’s a tremendous bias both among scientists and the news media. We always talk about the outliers, the things that look odd to us. We make a big deal about them, and of course, to some extent we should. But we often forget that although these outliers look unlikely,

  • they aren’t generally as unlikely as they look to our brains, and
  • the probability of something being an outlier, when many things are being measured, is not small.

I personally think it would be enormously helpful if scientists would remind the news media of this well-known fact, and if the news media would convey it to the public. It would help the public understand how science is really done — and explain why it is so very common that the big story that you read about on the science pages simply disappears, after its 15 minutes of fame, without leaving a trace.

The most interesting outlier is ATLAS’s hint that the Higgs decays more often than expected to two photons; on this everyone (including the Scientific American journalist) agrees. We had that hint last December, in July, and still now.  However, although CMS saw something somewhat similar in July,  they’ve (somewhat disturbingly) delayed making their most recent data public on this measurement.  So right now there’s still no way to know if this is more than a fluke, because the ATLAS result by itself is still not statistically significant.

Can You Find The Higgs-Like Particle?

Ok, everyone; by now you’ve all learned that the ATLAS experiment at the Large Hadron Collider [LHC] announced recently their measurement of the mass of the Higgs particle, in its decay to two photons, is at 126.6±0.3±0.7 GeV/c²; and meanwhile they measure the mass for apparently the same particle, in its decay to two lepton/anti-lepton pairs, to be 123.5±0.9+0.4-0.2 GeV/c², about 3 GeV/c² lower. “So bizarre”, wrote Michael Moyer at Scientific American,  bandying about the idea that there are two Higgs-like bosons in this data (though, having pointed out the ambulance to you, and neglecting also to mention CMS’s data from November that directly disfavors this interpretation of the ATLAS data, he tells you later that some physics bloggers say you shouldn’t chase it…)

Well.  How bizarre is this 2.7  standard deviation discrepancy really?

At the end of this post are 20 plots showing randomly generated data, in amounts comparable to those used in the current measurement of Higgs decaying to two lepton/anti-lepton pairs (often called “four leptons” for short). [Warning: This certainly hasn't been done with the level of care needed to match the ATLAS measurement in any precise way; I'll say a bit more below about the caveats.]

  • In some of the plots, there’s just a random flat background of about 40 events, similar (though not identical) to what arises in the four-lepton Higgs measurement.
  • In some of the plots, there’s a Higgs-like peak of about 20 events — a very sharp peak with a perfect detector, but one which is smoothed out a bit by the inevitable imperfections in a real particle detector.
  • In each plot with a peak, the mass of the Higgs has been chosen to lie somewhere between 122 GeV/c² and 127 GeV/c² [not equally populated].

So:

  1. Can you tell which plots have a Higgs-like signal, and which ones don’t?
  2. In each plot where there is a signal, can you estimate the Higgs mass? Again, in each plot, it lies somewhere between 122 and 127 GeV/c².  You’re not going to get it exactly right — that’s impossible — but do your best, and let’s see what happens.

A couple of additional comments to help you:

  • The resolution on the measurement (i.e., the effect of imperfections in the measurement) is such that with infinite amounts of data, the peak that you’d observe would be a bump whose width, at half the bump’s maximum height, would be about 4 GeV/c².
  • The bins in each plot are 1 GeV/c² wide; the bin just to the right of the number 2 in “125” runs from 125 to 126, the next from 126 to 127, and so forth.

Lastly, a caveat: in the real ATLAS or CMS measurement, the mass of the Higgs is not measured simply by fitting a Gaussian peak over a flat background. So don’t take this exercise too seriously! It’s just a useful learning experience, and nothing more.  What the experiments  actually do is far more sophisticated, accounting for the properties of each event separately!!

Here we go: Good Hunting!  (If your browser has trouble with the figure, try clicking on it.)

[Solution is now available.]

Twenty sets of simulated data, showing number of events versus the “mass” of a putative particle, in GeV/c2. Each contains about 40 events of a non-Higgs-like background that is flat in mass.  Some but not all plots contain a signal which is in the form of a peak, comparable to the current size of the expected Higgs particle signal (about 20 events). In each case the mass of the Higgs is chosen between 122 and 127 GeV/c2.  Experimental imperfections make the full-width of the signal peak at half its maximum about 4 GeV/c2 wide.

The Constancy of the Heavens — Verified Anew

This is a post about constancy and inconstancy, one of my favorite topics.  And about how alcohol can make you smarter.

There are many quantities that we call “constants of nature”.  Of course, anything we call a “constant” is merely something that, empirically, appears to be constant, to the extent we can measure it.  Everything we know comes from observation and experiment, and our knowledge is always limited by how good our measurements are.

We have pretty good evidence that a number of basic physical quantities are pretty much constant.  A lot of evidence comes from the constancy of the colors of light waves (i.e. the frequencies of waves of electromagnetic radiation) that are emitted by different types of atoms, which appear to be very much the same from day to day and year to year and even across billions of years (neat trick! will describe that another time), and from here to the next country and on to the moon and to the sun and across our galaxy to distant galaxies.  For example, if the electron mass changed very much over time and place, or if the strength of the electromagnetic force varied, then atoms, and the precise colors they emit, would also change.  Since we haven’t ever detected such an effect, it makes sense to think of the electron mass and the electromagnetic force’s strength as constants of nature.

But they’re not necessarily exactly constant.  One can always imagine they vary slowly enough across time or place that we wouldn’t have noticed it yet, with our current experimental technology.  So it makes sense to look at very distant places and measure whatever we can to seek signs that maybe, just maybe, some of the constants actually vary after all.

[I wrote a paper in 2001 with Paul Langacker and Gino Segre about this subject (Calmet and Fritzsch had a similar one).  This followed the observational claims of this paper (now thought false) suggesting the strength of electromagnetism varies across the universe and/or with time.  A lot of what follows in this post is based on what I learned writing that old paper.]

Suppose they did vary?  Well, the discovery of any variation whatsoever, in any quantity, would be a bombshell, and it would open up a door to an entirely new area of scientific research.  Once one quantity were known to vary, it would be much more plausible that others vary too.  For instance, if the electron mass varies, why not the W particle’s mass, which affects the strength of the weak nuclear force, and thereby radioactivity rates and the properties of supernovas?  If the electromagnetic force strength varies, why not that of the strong nuclear force?  There would be interest in understanding whether the variation is over space, over time, or both.  Is it continuous and slow, or does it occur in jumps?  One can imagine dozens of new experiments that would be proposed to study these questions — and the answers might reveal relations among the laws and “constants” of nature that we are currently completely unaware of, as well as giving us new insights into the history of the universe.

So it would be a very big deal.  [Though I should note it would also be puzzling: even small variations in these constants would naively lead to large variations in the "dark energy" (i.e. cosmological "constant") of the universe, which would potentially make the universe very inhomogeneous.  However, we don't understand dark energy, so this expectation might be too naive.] Since there’s no story about it on the front page of the New York Times, you can already guess that no variation’s been found.  But a nice new measurement’s been done. Continue reading

Dark Matter Around the Corner?

The meaning of the title of Clara Moskowitz’s new article for the public, “Dark Matter Mystery May Soon Be Solved“, all lies in the word “may”.  It may.  It may not.

According to the article, “the answer to this cosmic mystery could come within the next three or four years, scientists say.”

I have to admit that this kind of phraseology, which one often sees in the press in reports about science, drives me a bit nuts.  Which scientists? How many of them?  You can’t tell from this line whether this is something that a group of three or four mavericks are claiming, or whether it is conventional wisdom shared by most of the community.   And “the answer… could come…”? Interpreted literally it is content-free: yes, the answer could come in the next few years, or not — but you don’t need any scientists to tell you that.  If one interprets it more optimistically — that it is intended to imply that the answer will very likely come within the next three or four years — then I think it is far from clear what fraction of the experts will agree with that statement.

Rather than debate the claim, let’s start with the physics.  What will determine how long it takes to discover what dark matter is made from? Continue reading