Streamlining the ’pythagorean theorem of baseball’

Mathematicians test simplified formula to predict winning baseball percentages

Is your local Major League Baseball team better than its record suggests? Math researchers are considering alternatives to the Pythagorean Theorem of Baseball, devised by baseball statistician Bill James. Introduced in the 1980s, the “theorem” predicts the winning percentage of a baseball team based on how many runs the team scores–and how many runs it allows.

Websites, including ESPN’s, often include the Pythagorean prediction of the winning percentage of a team during the season. Fans compare the Pythagorean Theorem to the actual winning percentage, in an effort to determine if a team is under- or over-achieving.

When a team scores fewer runs than it allows, the Pythagorean model predicts that the team should have a losing record. For the 2001 season, the New York Mets allowed more runs than they scored and had a winning record; they did much better than the Pythagorean model predicted. So they can be considered an overachieving team. Because the Colorado Rockies scored more runs than they allowed but had a losing record, they were possibly an underachieving team.

Now, Michael Jones and Linda Tappin of Montclair State University in New Jersey have devised mathematically simpler alternatives to the Pythagorean Theorem of Baseball.

To predict the winning percentage of a team, one new model simply uses a little addition, subtraction, and multiplication. It starts with the total runs scored by the team in all its games (Rs), and subtracts the runs it allows (Ra), and then multiplies it by a number called “beta” (B) which is chosen to produce the best results. For the 1969-2003 seasons, the optimal values of B range from 0.00053 to 0.00078, with an average of 0.00065.

Adding 0.5 to the result gives the predicted winning percentage of the team. The resulting formula looks like this:

The estimated winning percentage, P = 0.5 + B*(Rs-Ra)

Because they only use addition, multiplication, and subtraction, these formulas are known as “linear functions”-the simplest kind of equations in mathematics.

In contrast, the original Pythagorean Theorem of Baseball is more complex. It uses exponents: Runs scored and runs allowed are squared-raised to the second power. The resulting formula is: P=[Rs2/(Ra2+Rs2)]

The equation gets its name because of its similarity to the Pythagorean Theorem in geometry, which relates the lengths of the sides in a right triangle as a2 + b2=c2, where a and b are the shorter sides and c is the longest side (the hypotenuse).

Because the Pythagorean theorems use exponents, these formulas are “nonlinear” equations, which are generally more complex than linear formulas.

So was the original Pythagorean Equation of Baseball needlessly complicated? Does the linear equation do just as good a job?

For the baseball seasons between 1969-2003 the linear formula works almost as well in its predictions as the original Pythagorean theorem, Jones and Tappin reported at this winter’s Joint Mathematics Meetings in Phoenix. The one real exception is the 1981 season when there was a baseball strike.

While Tappin and Jones have only analyzed whole seasons with their new formula, they are exploring how well it works for seasons-in-progress. If their formula meets with continued success, you may soon find it on your favorite sports website.

Media Contact

Ben Stein EurekAlert!

More Information:

http://www.aip.org/

All latest news from the category: Physics and Astronomy

This area deals with the fundamental laws and building blocks of nature and how they interact, the properties and the behavior of matter, and research into space and time and their structures.

innovations-report provides in-depth reports and articles on subjects such as astrophysics, laser technologies, nuclear, quantum, particle and solid-state physics, nanotechnologies, planetary research and findings (Mars, Venus) and developments related to the Hubble Telescope.

Back to home

Comments (0)

Write a comment

Newest articles

First-of-its-kind study uses remote sensing to monitor plastic debris in rivers and lakes

Remote sensing creates a cost-effective solution to monitoring plastic pollution. A first-of-its-kind study from researchers at the University of Minnesota Twin Cities shows how remote sensing can help monitor and…

Laser-based artificial neuron mimics nerve cell functions at lightning speed

With a processing speed a billion times faster than nature, chip-based laser neuron could help advance AI tasks such as pattern recognition and sequence prediction. Researchers have developed a laser-based…

Optimising the processing of plastic waste

Just one look in the yellow bin reveals a colourful jumble of different types of plastic. However, the purer and more uniform plastic waste is, the easier it is to…