Human Knowledge Doubles

The question is how quickly. Here's yet another summary of what is in this weblog. Since April, we have shown that primary evidence of human knowledge, as shown in patents and academic publication, doubles at different rates for different sectors, ranging from 2 years for nanotechnology to 21 years for other sectors.

Secondary evidence, such as the number of university students and funding for research, varies by country. The OECD has reduced government funding for research (and the growth in patents and students in scientific disciplines) has dropped to zero. China and India, among others, are funding research and encouraging students to enter scientific fields, and they are seeing rapid gowth in publications, patents, research funding and students.

But the growth of Asian knowledge engines has not yet replaced the flattening of the growth curve in the developed world. This has led to uneven growth in knowledge--in sexy sectors like nanotechnology, growth is very rapid. In other areas, such as Alzheimer's Disease, it is not as rapid as one would think.

04/25/2009

A New Forum For My Writing

Hi all,

As you may have noticed my output has slackened. This is not due to a lack of interest in the topics covered here, but rather to the fact that I have started as a columnist for Examiner.com. I will be writing more frequently there, as I actually get paid to do so.

I welcome your visit there--bookmark me and check back. I get paid by number of visits...

Here's my Examiner home page.

I will continue to post on this weblog--and will probably get back to a semi-regular schedule after the newness of the new toy wears off a bit. Besides, some of the stuff I write here probably wouldn't make it into a daily newspaper column, right?

07/21/2007

Identity Management and Human Knowledge

Identity Management (IDM) is a discipline that tries to supervise how passwords are protected and verified, how IT departments decide what files you have access to, etc.

It's a new field--they're still debating the name, they haven't really defined the limits, and they don't know where the concept should sit inside organisations. Perfect for what we're doing here. We've looked at very long time series, for things such as Mars and Alzheimer's Disease and battlefield injuries. We've looked at timelines for fairly new fields, such as nanotechnology. Here is a field that is just being born.

The first academic work starts to appear in 2000, according to Scirus, and there are only 612 academic publications with the phrase "identity management" in the title, abstract or text overall. Truly nascent.

There have been 1,172 patents filed with the phrase "identity management" in the title, abstract or text overall.

In the extended post below, I show the figures and explain a bit more about how I calculate the compound annual growth rate, and show an alternative means of calculating it.

Using the method I've employed throughout this weblog, the growth rate for academic publications is 34.93%, which would double every 2.31 years, and the growth rate for patent applications is 32.93%, which would double every 2.44 years. and I would then write 'Hooray! We have another winner!'

But the alternative means of calculating growth would look even better, showing growth in academic publications of 71.54% and a doubling time of 1.28 years, and for patents, CAGR of 71.37% and a doubling time of 1.29 years.

I'm going to stick with my method, but if you want to recalculate all of the figures, the tools to do so are here. Click to see the numbers and the explanation.

Continue reading "Identity Management and Human Knowledge" »

Two New Sectors To Measure

The project that called me away from this explored the growth of two new 'disciplines'--identity management and information assurance. Both are attempts to put strategic management protocols around all the work that is going on in subsectors relating to passwords, permissions, hand-offs of responsibilities between websites and organisations, etc. They're both new, and they both deal with subsectors that they want to 'control' (i.e., have oversight responsibility for). So I've been exposed to information on the growth of both of them and other sectors as well, such as biometrics.

The next three posts will look at knowledge growth in these sectors. Since two of them were essentially invented after 9/11, the time series will be blessedly short.

07/20/2007

Well, I had a plan...

I have been hijacked away from this project for two months, working on a very interesting project called Blindside, which had the benefit of having a paycheck attached to it. It also is tangentially related to what I am doing here, as my next few posts will show.

No reason why you should care about any of the above (but do follow the link if you're interested in how technology relates to UK government, information assurance issues and identity management), but it has caused me to rethink my scheduling for this. I'm going to do more primary research--maybe through the summer--and then tie it up with a couple of really brilliant think pieces and try to sell it as a book.

I'd like to welcome the flood (well, trickle) of readers who have actually found this weblog--I assume it's academic assignments that lead you here. I hope you feel free to comment or ask questions.

Anyway, back to work.

06/03/2007

Real World Consequences of These Issues

Let's try to look at real world effects of this dangerous illusion we are discovering.

The illusion is that a rapid growth in human knowledge (doubling every five years) is happening all around us, naturally and painlessly. As we hope to have shown on this weblog, human knowledge about a particular subject can increase that fast or faster, but it is always as a result of putting money, institutional resources, interested and bright people to do the research, together. It doesn't hurt to have an urgent public need (HIV) or potential private gain (nanotechnology) as a carrot.

But where these inputs are not provided, human knowledge about a subject grows at a much slower pace. Let me explain what I mean.

Click below for more...

Continue reading "Real World Consequences of These Issues" »

06/02/2007

Update and Summary

We are still 5 days short of our two-month anniversary on this weblog. I am going to try and do some more data mining before the 7th of June, but shortly after that date I am going to turn towards deeper analysis of data already collected. After the 7th, I will be doing research aimed at clarifying points brought up during the first two months' work.

What have we learned so far?

  • Only 10 of 25 sectors explored are growing at a rate that would double every 5 years
  • In areas that are heavily favoured by government or private industry, a sector can grow even faster than a 5-year doubling rate. But in areas that are not so favoured, growth ranges from a 10-15 year doubling rate (typically)
  • In the U.S. this is correlated with an erosion of federal funding for research
  • Growth in Asian knowledge engines (university enrollment, funding, patenting) does not overcome a decline in OECD states. It tracks GDP growth, and appears to be at a level that will barely sustain development in their domestic economies
  • The increase in the availability of existing knowledge to more people is dramatic, and perhaps may compensate for the less-than-hoped-for growth in new knowledge.

So let's see what happens next.

05/28/2007

More on U.S. Federal Funding for Research

Okay, as promised--here are the growth rates and doubling periods for the subcategory disciplines for U.S. federal funding. They are taken from this spreadsheet: Download us_funding_for_research3.xls, which is found here on the NSF website, where it is labeled Table 1A.

There's a real shocker below (to me, anyways). I put the charts up twice--once using the straight figures given by the NSF, and again after adjusting for inflation.

If anyone wants to know why the U.S. research output has declined, look at the second one. In 14 of the 26 disciplines measured, inflation-adjusted growth is negative. Only two categories show growth that is not embarrassing--computer sciences and 'other medical.'

Elsewhere we have seen that the number of the student-age population has declined (although higher proportions are enrolling). We have heard that our proportion of the contribution to the increase of knowledge is bound to decline, given the growing contribution from other countries such as China.

It's all bull. The U.S.A. defunded research. Click to see the numbers. Read 'em and weep.

Continue reading "More on U.S. Federal Funding for Research" »

The Future of Blogging

Well, I'm going off topic again. I've been doing this for a little less than two months, and I have some observations about the process I'd like to share.

First, I think a weblog is the most appropriate forum for me to conduct this research. I will probably test the waters to see if I can get a book out of this, but the ability to hyperlink, the opportunity to write conversationally as I work, the tagging feature of categories (as opposed to the rigidity of chapters), all have made this an easier job for me--something I can do nights and weekends--rather than a herculean labour.

Second, after everybody gets tired of using weblogs to recount their previous night's drinkng exploits, I think that efforts such as this will become the best use of weblogs. I'm not doing anything here that you could not do, if you are reading this. I've put up my principal sources of information and the major tools I use for reference, and if you want to do your own work or fact-check mine-or even take it further, I'll be happy to help in any way I can.

Lastly, as a non-academic (but one who has been exposed to a lot of academic publishing), I recognise that the work I'm doing here is not sufficient to be authoritative. However, I'm trying to ensure that if this stimulates the conversation on the topic adequately for institutional investigation, any researcher will be able to start from a higher point than if I had not done this. And you know what? That's enough for me.

Funding Frustration

Well, the previous two posts give a fairly good picture of how the U.S. federal government funds research (note that we did not examine state and local funding, nor private sector funding of R&D, nor defense funding, for that matter).

I feel satisfied that I know what's happening in the U.S. But my real frustration is that you, dear reader (icy ironic tone, please) will quite reasonably expect me to find this out for the world.

But while it only took 7 hours to compile the information for the U.S. (due to better and more available statistics), it will take weeks to get a picture of what's happening in the world.

I think I'll put a tip jar on this blog.

U.S. Research Funding by Sector

I think the real problem with this post will be presentation. I have an Excel spreadsheet that shows U.S. federal funding for reseach by sector, and I want to show the growth rates and doubling times. However, the time series for some sectors differs, for a variety of reasons. So, please be patient with me.

I will put the figures in the extended post below, as well as some analysis, and report the topline findings here. In a later post I will do the same for subcategories. I think it will look messy. The purpose is to see what choices are being made, so we can compare funding differences to outputs--the numbers of academic publications and patents in each field.

Mathematics and computer sciences have seen the greatest growth, with a CAGR of 10.54% and a doubling time of 6.92 years. Strangely, physical sciences (including astronomy, chemistry and physics) fared second worst, with a CAGR of 5.14% and a doubling time of 13.83 years. However, it did start with the second highest base total, if that's any justification... Slowest growth in federal research funding was experienced in social sciences, with a CAGR of 4.82%, which gives a doubling time of 14.72 years.

Update: I had to recalculate all the stats to account for inflation, converting all $ figures to 2006 USD. It changes everything...

Continue reading "U.S. Research Funding by Sector" »

05/27/2007

Into The Jungle Of Research Funding

Oh, this hurts. I did not want to go into research funding. But now that it is fairly clear that the rate of progress varies significantly by sector, we have to examine the inputs.

The first relevant document found is a PDF from the European Commission. It states that it wants the EU to increase research 'efforts' to 3% of GDP by 2010. Ambitious, that. The document talks about the importance of 'basic research,' as opposed to directed, I assume, which is encouraging. It actually serves as a good basic primer on how funding is structured in Europe, the U.S. and Japan, but while it talks about funding for specific programmes, it doesn't talk about levels of funding at the national or international level. So we move on.

Curious Cat, a blog that bills itself as a Science and Engineering blog, posts research funding for U.S. academic R&D for the past few years (see below in the extended post). More importantly there's a link to the NSF where we find an Excel spreadsheet (also in the extended post below) that charts all federal spending on research for science and engineering between 1970 and 2003. There are other charts available for viewing and downloading here.

In 1970, combined spending by all U.S. Federal agencies for research in science and engineering totaled $4.9 billion (USD). By 2003, that had grown to $53.3 billion (USD). Not bad.

The compound annual growth rate for U.S. Federal funding for research into science and engineering is 7.28%, which means it doubles every 9.86 years.

However, adjusting for inflation to 2006 U.S. dollars, it reads a bit differently. In 2006 dollars, the spend in 1970 was the equivalent of $26 billion, while the 2003 spend rises on recalculation to a total of £58 billion. That provides a CAGR of 2.39%, which would double every 29.35 years. A bit sad, that.

But the spreadsheet breaks spending down by discipline, as well. I'll address that in the next post.

Continue reading "Into The Jungle Of Research Funding" »

League Tables and Update

Here is the league table for 25 points of measurement of the growth of human knowledge. As shown, only 10 of the 25 grow faster than 14.87% per year, which is what is needed to double every 5 years.

Field                                                            CAGR                   Doubling Period (Yrs)
1. Nanotechnology patents                                  44.91                           1.87
2. Nanotechnology journals                                 42.03                           1.98
3. Global warming patents                                  38.62                            2.12
4. Prions patents                                                33.76                            2.38
5. Programming patents                                      33.53                            2.4
6. Stem Cells patents                                           26.47                            2.95
7. Prions journals                                                 25.57                            3.04
8. Global warming journals                                    24.71                            3.14
9. Epidemiology patents                                        17.37                             4.33
10. Stem Cells journals                                          16.63                             4.51
5 Year Doubling Rate                                          14.87                             5
11. Programming journals                                     12.55                             5.86
12. Alzheimers Disease patents                              11.26                             6.5
13. Oncology patents                                              10.02                             7.26
14. Alzheimers Disease journals                                9.65                             7.52
15. Oncology journals                                                9.23                             7.85
16. DeSolla Price estimate of world literature growth   7.0                             10.24
17. Epidemiology journals                                           6.22                           11.49
18. Mars journals                                                        5.78                           12.34
19. Shale oil journals                                                   5.53                           12.88
20. US patent grants                                                    5.21                           13.65
21. University enrollment worldwide                              4.85                           14.64

22. Publications in astrophysics since 1970                     4.0                             17.67
23. US patent applications                                             3.88                           18.21
24. U.S. Book publishing                                                3.65                           19.33
25. Shale oil patents                                                      2.58                           27.21

More discussion below--click if you want to see it.

Continue reading "League Tables and Update" »

05/26/2007

So, Does Human Knowledge Double Every 5 Years?

1. No,

2. But it could

3. Because we see that it does in sectors where we pour in human and financial resources.

However, the Western world is not pouring in human and financial resources the way it used to. Students are choosing law, business and web design instead of engineering and science. Government is encouraging private sector R&D (and encouraging universities to work closely with business), but it doesn't happen enough and when it does, it is focused on very short term proejcts that improve a bottom line in the next quarter.

Absent the commitment of extra resources, it looks like human knowledge doubles every 10-15 years. I'll be showing my work in the next post, and then supplementing it with information about the commitment of resources, an unwelcome next task.

But where there is a clear incentive, such as nanotechnology or global warming (where the incentive may be just getting a project funding by slapping a green label on it), we get the output that shows that our knowledge and exploitation (via patents) of a given sector can double every five years.

Continued below--click if you want to see a bit of a rant...

Continue reading "So, Does Human Knowledge Double Every 5 Years?" »

Stem Cells

Searching for "stem cells" or "stem cell" on Scirus returns 995,702 results, of which 152,648 are journal publications with the term in the title, abstract or contents, and 62,220 patents. According to Wikipedia, Stem cells were postulated in the early 1960s and shown to exist in 1963, giving us a 40-year window to chart the progress of human knowledge. However, looking at Scirus, I find articles as early as 1916 that seem relevant, and a patent in 1947 that also looks relevant to a layman.

Starting in 1960 excludes 74 journal articles and 2 patents, so we will make that our base year. That gives us a Compound Annual Growth Rate (CAGR) of 16.63%, or enough to double human knowledge about stem cells every five years. Hooray! We have another winner. That actually would double every 4.5 years. For patents, the CAGR is higher, 26.47%, which would double every 2.95 years. (Just for fun, if I used a base year of 1947, when the first patent was filed, the CAGR is still enough to double human knowledge every five years, although it drops to 20.2%).

Perhaps the most important finding is that President Bush's restrictions on stem-cell work in the U.S.A. has not dragged progress down, with over 100,000 journal publications and over 56,000 patents filed since he took office. Of course, I'm not counting how many of each are American, but that's for domestic policy wonks to debate... or even research, if they have the time.

As in some (not all) other sectors, we see that the quickest growth for academic publications is early, while patents explode just about the time that growth slows in academia. I'm not sure that's a coincidence...

Again, we see that when the incentives are properly allocated, we can double human knowledge in a sector every five years.

Click to see the figures below.

Continue reading "Stem Cells" »

05/22/2007

That Eureka Moment--University Students Worldwide

I have been searching for this for quite a while. UNESCO had it--click here to see it on their website. I include the graph here--click on the image to see a larger version.

Tertiary_education  The number of tertiary education students has grown from 68 million in 1991 to 132 million in 2004. That's a doubling over a 14 year timespan. That's a compound annual growth (CAGR) rate of 4.85%, which would lead to a doubling every 14.64 years. It's a third of the growth rate we would need to double every five years.

This is daunting. We see the world university student population mirroring the growth rate of India since 1947, when what I think everyone assumes is a growth rate similar to China's.

I don't think this is adequate growth to sustain a required expansion of conventional infrastructure, let alone power a Singularity.

05/20/2007

Alzheimers and Human Knowledge

Alzheimer's disease destroys human knowledge, but perhaps more importantly, it decoheres the personality that organises this knowledge. It is only human knowledge that can defeat this destroyer of human knowledge. It has been described,  treated (unsuccessfully) and researched for a century.

Let's see what Scirus tells us about the growth of human knowledge (and the commitment of research resources) on this. (BTW, I recently had another look at the sources Scirus uses, and it includes PubMed, so I won't be replicating this search there.) (And I got a quick scare by starting with the search term 'Alzheimers,' which only returned 1,428 results--if you're fact-checking me, and I hope to God someone is, use 'alzheimer.')

A search for Alzheimer on Scirus returns 1,274,572 results on Scirus, of which 112,401 are journal publications containing the term in the title, abstract or text. The search also returned 67,900 patents.

In the extended post, I will give the results by year and by five-year period. That's a Sunday morning for you--hot coffee and a database to search.

Okay--we'll assign 1920 as the base year for journals and infer 4 journal articles published that year, giving us a CAGR (Compound Annual Growth Rate) of 9.65%. This would indicate that human knowledge regarding Alzheimer's Disease doubles every 7.52 years.

However, the first patent was filed in 1976, which gives us a CAGR of 11.26%. This would double every 6.5 years. So, although work proceeds at a good rate and human knowledge is increasing, there is nothing like the growth rate seen in areas such as nanotechnology or global warming.

Again, we see that patent growth, starting at a later date, grows much more quickly than academic publishing. I am personally mystified as to why research into prions has grown twice as quickly as research into Alzheimers. Many more people suffer from Alzheimers than CJD, and the potential for both alleviating human suffering and reaping large rewards are much greater. The overall totals for Alzheimers research are greater than prions, don't get me wrong--but why is the growth trend favoring prions?

Continue reading "Alzheimers and Human Knowledge" »

05/19/2007

University Students Worldwide

The OECD charges for their statistical publications, but sometimes you can find a browsable 'Read It' version (sort of try before you buy?). In any event, I found a freebie copy of a 168 page PDF of their Focus on Higher Education document for 2005-2006.

I'm looking for numbers for enrollment, hopefully a time series. However, as I browse through I will put other relevant information here in bullet points. Bear with me--this is a work in progress.

  • There are more than 17,000 higher education institutions in the world.
  • 'In the past three decades, the proportion of young people enrolled in higher education has risen from 20% of the population to just over 50% on average in the OECD.(Angel Gurria, Secretary General of the OECD.)
  • In 2002, approximately 3.6 million researchers were engaged in R&D in the OECD (p. 46)
  • In 2003, 2.12 million students were enrolled in universities outside their country of origin (93% in the OECD) (p. 64). This has increased by 50% since 1998.

And that's all. Big disappointment. I have seen elsewhere the figure of 88 million university students worldwide--it was from a UNESCO high muckamuck, and I'll link to it when I re-find it. But what I need are annual figures... for the world... going back 1,000 years, if possible--hey, you could send me a link...

05/17/2007

UK Higher Education Numbers

According to this website published by Universities UK, In 2004/05, there were approximately 2.3 million students in the UK (Total undergraduate and postgraduate).

This chart shows growth from 1961 through 1995, going from 200,000 to 1 million in 34 years. It's evidently more than doubled in the past eleven years. Even looking only at this period, compound growth is 7.87%, which would double every 9.85 years. Looking at the entire period, CAGR is 7.02%.

Uk_students Click on the image to see a bigger version. Courtesy of University of Leeds

With the recent addition of student fees, I don't see hypergrowth kicking in any time soon. Hope the rest of the world will compensate.

Prions

It's back to Scirus to see trends in publications and patents regarding prions.

Prions are infectious agents composed only of proteins, and are implicated in BSE (bovine spongiform encephalopathy, AKA mad cow disease) and Creutzfeldt-Jakob Disease (CJD) which as far as I can tell is more or less a human equivalent.

Prions were hypothesized in the 60s and actually found in the 80s, winning a Nobel prize for the disoverer, Stanley Prusiner.

The human disease is quite rare. Mad cow disease is not. Both are fatal. There is a public good to be pursued in preventing CJD and private profit in preventing mad cow disease. The two together should stimulate research. With a short timeline to look at, it will be interesting to see if human knowledge doubles every five years.

Scirus returns 13,518 journal publications with 'prion' in the title, abstract or text, and 8,781 patents. Because there is a bird called the fairy prion (Pachyptila turtur), I will actually have to look at the results... darn.

We hereby proclaim a base year of 1981, when 5 journal articles and no patents were published. That gives us a CAGR (compound annual growth rate) of 25.57% and we have another winner! Human knowledge has doubled every 5 years regarding prions! And patents have grown even more rapidly, at 33.76%--huzzah!

However, as we have seen elsewhere, a note of caution--while patent growth has consistently exhibited hypergrowth characteristics, journal publications have leveled off, with the period 2002-2006 growing 'only' 66% over the preceding five-year period.

Click below to see the results.

Continue reading "Prions" »

05/16/2007

A Smart Science Fiction Writer Looks At The Future--And The Singularity

Charlie Stross looks at improvements in the speed of travel, information technology and bandwidth, says some very interesting things about convergence and the near future.

One idea just charms me--that our children will never have to worry about being lost, any more than we had to worry about being eaten by wolves. I suppose in about ten minutes I'll start thinking about the Big Brother implications, but these ten minutes are nice.

Hat tip Brad DeLong.

05/13/2007

Programming, Software and Computer Science

Computer programming has a lot of work and a lot of people centered around it. If the Singularitarians are correct, we're going to need them all.

After all, what good is it if computing speeds and memory keep doubling if we cannot program computers to do what we want them to do? And the Singularitarians want computers to do some pretty ambitious things. In fact, given their wish list, it seems clear that human knowledge of computer programming has to double every five years if we are going to get the magic box to do what we want it to do. We will need new programming languages, tighter coding, better algorithms, and quite possibly a new way of looking at programming. Is this happening?

We've seen elsewhere that the number of people engaged in computer sciences has grown to 180,000 new entrants per year in the U.S. alone--and considering the base date for this field, that is evidence of healthy growth. Let's use Scirus to see what this growth in professionals has produced in the way of patents and journal articles.

Preliminary results: between 2002 and 2006, 38,685 journal articles were published with the word 'programming' in the title or abstract. As the total is 145,404, only 26.6% of all journal articles were published in the last five years. Of the 367,971 patents noted on Scirus with 'programming' in the title or description, 201,548, or 54.7% were published between 2002 and 2006--so that has doubled. Not the first time we have seen differing growth rates for journals and patents.

For 'software,' the period 2002-2006 saw 224,696 journal articles published, 46% of the total of 486,262. For patents, as with 'programming,' the period was more robust--589,919 out of a total of 900,846, or 65% of the total.

The base year for 'programming' is 1940, when a UK patent was filed for improvements on a statistical calculating device. The compound annual growth rate for journal publications is 12.55%--good, but not good enough to double every five years (it is less than six years, to be fair). The compound annual growth rate for patents is 12.83%.

Remember that we need a CAGR (Compound Annual Growth Rate) of 14.87% to double every five years.

For 'software, the base year is 1965, when 5 journal articles were published and 1 patent was filed. And software is a winner, with CAGR of 22.83% for journal publications!  Hooray for coders everywhere! And a double huzzah to IBM, Microsoft, Sun and countless others, as patent growth since 1965 has a CAGR of 33.53%. Another winner!

As there are so many more patents than journal publications for both search terms, we should probably give more weight to patents, and be extremely pleased that human knowledge appears to have doubled in the past five years in this area, especially given the gloomy results reported in my previous post last week.

Click below to see results and analysis.

Continue reading "Programming, Software and Computer Science" »

05/12/2007

Outcomes

Advances in human knowledge should make life better. That's hard to measure. However, we can measure if human life gets longer (it does), healthier, richer (if not more rewarding), etc.

The rest of this post is philosophizing--feel free to skip. And in a previous post, I talk about the Outcomes Fallacy. But if you want thinking on the issue of human knowledge and outcomes, click to read more.

Continue reading "Outcomes" »

University Enrollment in China

If we start with the premise that university students are destined to either become creators or transmitters of human knowledge, large increases in their total numbers would augur well for future trends in the growth of human knowledge. (Obviously there's plenty of room for contribution by non-graduates--at least I hope so, as a non-graduate myself.)

If you are interested in the background of education in The People's Republic of China, I highly recommend this article in Wikipedia.

Here's an article by People's Daily, written in 2000, about China's plans to expand university enrollment. They cite the number of 2.04 million in 2000, and projected an increase of 1 million by the end of the year. But the article is really bad and really is not useful, other than as a signpost for further research. It did however, have the huge advantage of being the first return on a Google search for Chinese university enrollment.

The China Education and Research Network published a 2001 article writing about a goal to increase China's university enrollment to 15% (of 18-24 year olds) by 2010 from the then level of 10%.

A link at the bottom of the article led me to this announcement, which is a bit hard to decipher:

  • China's universities and colleges enrolled 2.6 million freshmen in 2001, 800,000 more than last year.
  • Officials with China's Ministry of Education said that China plans to increase the number of university students to 16 million in the five years to 2005, compared with a total enrollment of over 11 million in the 1996-2000 period.
  • According to a special plan for educational development, university students will account for 15 percent of school students in 2005. By that time, the number of postgraduates will reach 600,000.

So this supports, in a roundabout way, the first piece--2000 level of 10% with projected rise to 15%, but target date of 2005. Let's see if we can find out how they did...

In 1977, despite the fact that 5.7 million students took the uniform national examinations, there were only 278,000 university places available for them. What a waste... By 1985, (if I am reading the Wikipedia article referenced above correctly), that had risen to 560,000 places (there were also 550,000 places for professional and technical schools). And we see above that they enrolled 2.6 million in 2001. That's essentially a ten-fold increase in 24 years. Well done, China. But sadly, that's only a CAGR of 9.76%.

For science and engineering, we read that China is now putting out about 600,000 graduates per year. This does not appear to be a huge increase--Wikipedia tells us that there were more than 1 million applicants for science and engineering places in 1984, and we can kind of infer that there were about 200,000 accepted, a CAGR of 5.12%.

It very much resembles the growth in science and engineering found in the United States between 1950 and 2000. But it doesn't even replace that growth, let alone overcome the shortfall in U.S. (and OECD) science and engineering since 2000.

As so often happens, I find more relevant research after I've written all the above. (You get to see the work in progress, FWIW...)

The Chinese Ministry of Education does publish enrollment figures. I found out thanks to Wang Yibing, former UNESCO higher education specialist and part time professor at Zhejiang University. In this article (PDF), Wang shows the MOE figures for Institutes of Higher Learning from 1998 to 2006. I reproduce them below.

It shows that growth of post-graduate enrollment since 1998 (from 725,000 to 3,648,000) does show a CAGR of 22.38%. We have another winner! This is truly exciting--these are post graduates, after all. It almost makes up for the disappointment in U.S. patents found earlier in the week. Now we just have to hope that they don't get hijacked into designing toys for U.S. consumers...

Enrollment at Institutes of Higher Learning overall in China since 1998 also shows a 21.21% CAGR, again, more than sufficient to double every five years! Hooray, China! I am reluctant to carry this analysis back to 1977, because I believe my prior numbers separate universities and other institutes of higher learning, but we just have to root for another 5-10 years of growth.

Click to see the numbers.

Continue reading "University Enrollment in China" »

Trends in U.S. Knowledge Workers

This is another grab bag post, where I park interesting numbers for further exploration at a later date.

  • In the last 20 years of the twentieth century, the U.S. college-age population declined by more than 21%, from 21.6 million in 1980 to 17.0 million in 2000. The college-age population decline reversed itself in 2001, and will increase to 19.3 million by 2010 (a 13% increase over the year 2000 figure). Witty comment about how demography is destiny, please.
  • The private, for-profit sector is by far the largest employer of science and engineering workers. In 1999, 74% of scientists and engineers with bachelor’s degrees and 62% of those with master’s degrees were employed in private, for-profit companies. So we might see an explanation of why patents grow more quickly than academic publications--at least in some sectors...
  • Between 1991 and 2001, the number of jobs in the U.S. increased by less than 15% while science employment increased by more than 93%. Much of this can be attributed to the enormous growth in computer scientists. The number of computer systems analysts and scientists increased by 268%, or 1,135,000 people, during this period. The number of natural scientists increased by almost 33%. Within this category, medical scientists experienced the largest increase: 111%.
  • Within engineering, as in the period from 1991–2001, electrical and electronic engineering is projected to have the biggest absolute employment gain, up by 31,000 jobs, or about 11% between 2000 and 2010. Larger relative gains are expected for biomedical, environmental, and computer hardware engineers. Further declines in petroleum engineers, as well as mining and geological engineers, are projected. Employment for all engineering occupations is expected to increase by only 9.4% over this period.
  • The Bureau of Labor Statistics expects that the trends evident between 1991 and 2001 will persist. While the economy as a whole is expected to generate 15.2% more jobs over this decade, employment opportunities for scientists (excluding social scientists) and engineers are anticipated to increase by about 44%, amounting to an additional 2.3 million jobs.
  • Approximately 88% of the increase in science and engineering jobs will occur in computer-related occupations. Overall employment in these occupations is projected to grow by almost 69% between 2000 and 2010, with more than 1.9 million jobs being added.
  • Eight of the ten occupations expected to grow most rapidly between 2000 and 2010 are computer-related. The increasing demand for computer-related occupations reflects the rapid advances in computer technology and the continuing development of new computer applications, including the Internet and intranets.
  • Overall, computer specialists, a component of computer and mathematical occupations, are projected to increase by 68.6%, for example. Five computer occupations — computer software applications engineers, computer support specialists, computer systems analysts, computer systems software engineers, and network and computer systems administrators — also are among the occupations with the largest projected numerical job growth. Computer and data processing services is the economy’s fastest-growing industry, and in almost all industries, employment in computer occupations is projected to grow much faster than average.

Well, they might want to rethink their forecasting on petroleum engineers, but there is no indication that anyone expects hypergrowth in the numbers of science and engineering workers.

05/08/2007

Sobering News

The post below from yesterday, which details the stately, rather than exponential, increase in granted patents in the U.S., has really got me thinking, and I think we'd better report on the state of play here.

Let's detail those areas where we have seen adequate growth to double human knowledge every 5 years:

Update: We have since found other areas where the CAGR is sufficient to double human knowledge every five years:

Update 2: A modest review revealed errors in my calculations regarding oncology. Its position has changed (and not for the better).

Prions, where academic journal publications have increased at a CAGR of 25.57%, and patents have increased by 33.76%

Software, where academic journal publications have increased at a CAGR of 22.83% and patents have increased by 33.53%

Post-graduate enrollment in China, which has increased at a CAGR of 22.38% (although see below for more discouraging information about the growth of tertiary education worldwide)

(All percentages are CAGR (compound annual growth rate.)

1. Nanotechnology, with a blistering growth in publications of 97.16% per year. Patents, 76.15%.

2. Global Warming publications, with a of 16.85%

3. Total Chinese patent applications, at 25% per annum

4. Chinese computing patents, 58.29%

5. Chinese telecommunications patents, 30.26%

(was oncology)

7. Epidemiology patents, 22.76%

So far, that's it.

Now let's look at where we did not find anything like doubling every five years (remembering that we need a CAGR of 14.87% to double in five years time):

Update: We have now found other areas where human knowledge does not appear to be growing at a rate that will double in five years time:

Worldwide university enrollment, which has a CAGR of 4.85%

Alzheimer's disease, where academic publications have a CAGR of 9.65% and patents a CAGR of 11.26%

Programming, where academic publications have a CAGR of 12.55% and patents a CAGR of 12.83%

Chinese university enrollment, with a CAGR of 9.76%

1. U.S. patents granted, 5.21%

2. Epidemiology publications, 8.85%

3. Shale oil extraction, publications 5.59%, patents, 2.61%

4. Publications on Mars, 5.78% (see also here and here)

5. U.S. Book publishing, 5.14%

6. Indian university enrolment since 1947, 5.51%

7. Climate Models, 12.66% (Very close to 5-year doubling rate of 14.87%)

8. Improvements in Climate Models, 13.76% (Also very close)

9. Growth in publication of scientific and technical papers, 1995-1999

             OECD 0.1%

             U.S., -1.8%

             Canada, -2.2%

             UK, -0.1%

             Netherlands, -0.5%

10. Publications in astrophysics, 4% (since 1970)

11. Growth in percentage of cancer patients surviving 5 years after diagnosis, 0.7% (1960-2005)

12. Drop in percentage of soldiers dying from wounds received in battle, 1.54% (1941-2006)

13. Projected growth in number of scientists (U.S.), next ten years, 7.18%

14. Patent filings in India, last five years, 8.45%

15. Increase in patent filings, World International Patent Office, 1 year, 6.4%

6. Oncology pubications, 9.23% (revised)

Considering I cherry picked areas that I thought would be most likely to show robust growth, this is scary.

          

05/07/2007

Human Knowledge and Patents

It used to be that, when an academic found something new in her or his research, she or he would write a paper and submit it to an academic journal. And it used to be that when corporate research and development departments (or some geezer in a garage) made a discovery, they would rush to apply for a patent.

Now, academics are quite likely to file a patent, or the institution they work for might--or both, in combination. So it would not be surprising to see a growth in patent applications.

The United States Patent and Trademark Office has been issuing patents since 1790. As of March, 2006, the USPTO had issued more than 7 million patents.

And wonder of wonders, miracle of miracles, they have a statistics page that shows patent activity by year since 1790. Wow.

We'll have to juggle a bit here. The applications for patent stops at 2005 (390,733), but the patent grants time series goes to 2006 (173,771). It's kind of important which year to choose, because in 2006 patent grants returned to growth after 2 years of decline. Not coincidentally, the Patent Office hired 1,100 new patent examiners in 2006 to deal with a backlog of applications.

Going to 2006, patent grants have shown a CAGR (compound annual growth rate) of 5.21%, which would cause a doubling of patent grants every 13.65 years. Patent applications started being counted in 1840, and as I said, we don't have 2006 figures. For the 165 years available, patent applications have shown a CAGR of 3.88%, which would double applications every 18.21 years.

Well, perhaps the rate of growth has been higher recently. That's what the five-year comparison is supposed to show. We will focus on patent grants only, and statistics are in the extended post below.

There are only two five year periods where the number of patents granted is double that of the preceding five year period. After the Civil War (1867-1871), when patents climbed from 26,613 to 61,646, and between 1807 and 1811, when patents rocketed from 366 to 898.

There are many five year periods where the numbers of patents declined, the most recent being the period between 1977 and 1981.

The figures below look very much like the real world. As the population increased and the environment became more conducive to technological innovation, human knowledge grew--but not exponentially and not, indeed, in a straight line. Recent performance is good--but no better in percentage terms than many periods preceding it. The number of US patents granted has not doubled in the past 15 years, let alone the past 5. We'll take a look at other countries' patent statistics later--maybe they are making up for the U.S. shortfall.

Click if you want to see the numbers.

Continue reading "Human Knowledge and Patents" »

05/05/2007

Oncology

Entrez is the integrated, text-based search and retrieval system used at NCBI for the major databases, including PubMed, Nucleotide and Protein Sequences, Protein Structures, Complete Genomes, Taxonomy, and others. It is a service of the National Library of Medicine and the National Institutes of Health. However, it is included within Scirus, so the results will be taken from Scirus.

Searching with the term 'oncology' returns 148,053 results. 57,936 were published between 2002 and 2006 (inclusive). That's only 39% of the total, so it doesn't look like human knowledge of cancer has doubled in the past five years on that basis. In the preceding five years (1997-2001), only 38,153 results were published. That means that the most recent five year period did not even double the preceding five year period.

The CAGR for academic publications for oncology is 9.23% (since 1920), which would double every 7.85 years. Patent growth since the first filing in 1973 is 28.62%, which would double every 2.75 years. Using the same base date of 1920 would yield a CAGR of 10.02%, doubling every 7.26 years. I don't know which date distorts the picture less.

And the fact is, we don't know how many times we will have to double our knowledge of cancer before we will be able to make a big difference in the lives of those who have it. I have to say, I think I'd rather see the growth rate we note in nanotechnology here--and yes, I think I am saying I would make the trade off. I'm not advocating central planning of scientific research--I'm just wondering if research incentives are correctly allocated.

Now champions of nanotechnology wll say that nanotechnology in general has many direct applications to medicine in general, and further that some nanotechnology research is specifically concerned with oncology. And it's true. My reservation comes from my suspicion (for which I have zero evidence) that this is engineering led (hey--come here and see what my new toy can do!) rather than medicine-led (can you build something that will do x, y, and z in the following sequence and make sure it does not do a, b or c?) If my suspicion is correct, this wil prolong the search for solutions rather than shorten it.

Is there a truce in the war on cancer? Best put out that cigarette. Because we see that, although oncology did see a doubling of knowledge every five years up to 1991, that hyper-growth tailed off afterwards. Note that hypergrowth started before Nixon's war on cancer. Note that growth since hypergrowth stopped has been perfectly respectable--14,000 papers published in 2006 (more in 2006 than in the entire decade of the 80's), and 4,000 patents filed. And note especially that growth in patents has continued to double every five years. Make of it what you will.

The figures are below. Click to see the numbers.

Continue reading "Oncology " »

So Where Did This Crazy Idea Come From?

We saw below that President Clinton said that human knowledge doubled every five years in at least two different speeches. We saw below that a respected German professor in Artificial Intelligence said that the number of scientific publications doubled every five years. But neither of them cited a source. I doubt if President Clinton did the research, and I can't find anything published by the German professor that deals with this. So where did it come from?

On several searches I've seen reference to UNESCO as a source of this information. And indeed, at the 11th General Conference titled Universities--Gateway to the Future, in 2000 Mahdi Elmandjra said, "Richard Knight estimates that the total knowledge of mankind doubles every seven years and adds,"90% of the present knowledge of mankind has been produced in the course of the last thirty years. But if we define knowledge as the capacity to survive on earth in a sustainable manner then 90% of human knowledge has been lost during the last 30 years."

But Richard Knight (referenced in other papers of the conference) is a professor of architecture at the University of Genoa in Italy. I cannot find anything published by him--and the quote says seven years, not five.

Where does this idea come from?

Relevant Research from 1988

It would have been very strange if I were the only one asking this question and doing some research. I'm glad to find some prior work on this, not least because it means I'm not the only one who cares.

A report on the International Conference on Fifth Generation Computer Systems in 1988, written by Jakob Nielson, refers to research by Jorg Siekmann of the University of Kaiserslauten in the then West Germany.

Nielson writes, "Siekmann reported that the number of scientific journals doubles every 15 years, the number of books in university libraries doubles every 10 years, and the number of scientific publications doubles every 5 years. The knowledge explosion has already reached a level where in chemistry it is often cheaper to conduct a possible duplication of an experiment than to search the literature to find whether the result of a previous experiment can be used." That last bit is scary--I wonder if it's still true. Let's see if we can find more about Siekmann and his research.

Google likes him--a little too much. There are 108,000 search returns on his name.

The University has some material about him: Jörg H. Siekmann is the head of several research groups working in artificial intelligence at the "Universität des Saarlandes" and at the German Research Centre for Artificial Intelligence (DFKI) in Saarbrücken. He is a professor in the department of computer science and a director at the DFKI. The five research groups are:

Automated Reasoning for Mathematics: OMEGA 
Competence Center e-Learning
Formal Methods: VSE
Multi-Agent Systems
The IT-Security Evaluation Centre

Currently he is the chairman of the collaborative research centre "Resource-adaptive cognitive Processes" (SFB 378) funded by the German Science Foundation (DFG). He is chairman of the network of excellence on computational logic (CoLogNet), and vice president of the International Federation on Computational Logic (IFCoLog) . His main research interests are Artificial Intelligence, Automated Reasoning, Multiagent Systems and e-Learning in Mathematics.

But I can't find any more information about his statement at the conference. However, I did stumble across something quite interesting that might lead to the origin of the statement  Human Knowledge Doubles Every Five Years. See next post (above).

How Important Is All This?

Now that I've been doing this for a month, the question of how important this all is is becoming more relevant. When you're in the middle of the hard slog of gathering data, it's normal to get frustrated. It's also normal to get a little grandiose in your thinking about the worth of what you're doing, partly to motivate yourself to keep going.

So how important is all this? If I come up with the finding that human knowledge doubles every ten years instead of every five, who will care? What will it change?

I mentioned a few things in an earlier post that I am starting to think are actually relevant questions for society. If we are convinced that human knowledge doubles every five years, then we don't need to quit smoking, as cancer will be cured before it strikes us. Similarly, we don't need to worry about pollution, because we'll sort it out before the effects can truly harm society. You can adopt this lazy attitude about any of the problems our world confronts.

But it would be a mortal pity if human knowledge doubles every six or ten or twelve years and you die of cancer the year before the cure is discovered instead of the year after.

The idea that human knowledge doubles every five years is widespread. A Google search using the term human knowledge doubles every five years returns 1,680,000 results (obviously only a fraction are relevant).

If that information is wrong, and individuals are making important decisions and organisations are misallocating resources in this mistaken belief, then yes, this research is important and worth doing. Gee, I feel reassured.

05/04/2007

One Month Progress Report

Well, this weblog has now been up for 30 days. I had actually intended to do all the basic reseach in this time frame, but life has somehow interfered. A major reorganisation at work has slowed me down.

Nonetheless, we've managed to explore the growth in human knowledge in a number of areas, so let's recapitulate:

  1. We do find a doubling of the evidence of human knowledge (journal publications and patents) in some sectors, such as nanotechnology and global warming, new fields that double from a low base. I'm tempted to label this the Manhattan Project effect, but I'm not going to do that until I study that sector since 1945.
  2. We do not see a doubling in areas with a longer time series, such as astrophysics or the study of Mars, although there are periods of impressive growth in each.
  3. In some areas, we see different trends for patents as opposed to journal publications. For example, in epidemiology we see an 8.85% CAGR in journal publications in the recent past, while patents have grown by 22.76% annually since 1987.
  4. We do see evidence that human knowledge is available to twice as many people, and that has grown at a rate that more than doubles every five years. (Just looking at the growth of Wikipedia and the numbers of people connected to the Internet makes that fairly obvious).
  5. We do not see evidence that the number of scientists and researchers doubles every five years, and it is not at all obvious that emerging country growth is making up for a slowdown or actual decline in OECD numbers.

But there's a lot more work to do, and any of those five points could change. I'm having fun doing this, so we'll just carry on.

05/02/2007

Epidemiology

Well, Scirus, the database of Scientific knowledge I've been using frequently on this weblog, has a lot of material relating to epidemiology. I feel a bit vulnerable relying on this source so heavily--if anybody has suggestions for freely available information, I will happily (well, grumblingly and slowly and swearing occasionally) replicate these searches on it.

Searching for epidemiology on Scirus returns 1,500,983 results. 315,678 of these are journal articles. 8,530 are patent results and the other million plus are 'other' web results. So, focusing on journal articles and patent results since 1920, let's see if the numbers double every 5 years.

Human knowledge (as expressed by journal articles published) about epidemiology has not doubled in the past five years. Only 106,722 of the 315,678 journal articles were published between 2001 and 2006. It's truly a triumph of the human mind, but it is not doubling. It is a compound annual growth rate (CAGR) of 8.85%, which would result in a doubling every 9 years. Looking at the time series, we see dips and lulls and a steadily progressive trend line, with a couple of jumps. When I get the graphs prepared, it will be easier to see.

Overall, CAGR for epidemiology is 6.22%, which doubles every 11.49 years. Overall growth in patents is 17.37%, which doubles every 4.33 years.

However, backing out the 2007 patents (198) leaves a total of 8,332. Of that number, 5,444 (65%) were filed between 2001 and 2006. So while academic investigation did not double in the past five years, commercial research and development more than did so. Indeed, epidemiology patents appear to have more than doubled in every five year period since 1987, when 19 patents were filed, a CAGR of 22.76%.

I suppose, if we treated one journal article and one patent as exactly equal, we could add them up and compare the totals. But, as I certainly don't have any reason to think that might be the case, I'm not going to. The figures are below if you want to do it, though.

Click below to see the data.

Continue reading "Epidemiology" »

How Would We Define Victory?

Well, time to define a few things here. We are trying to discover if human knowledge doubles every 5 years. To do this, we are testing three hypotheses:

  1. We learn so much new information that the actual stock of knowledge is double what it was 5 years ago. We are testing this by seeing if scientific publications and patents are doubling every 5 years. We also look and see if there are twice as many scientists and researchers as 5 years ago.
  2. That new and old human knowledge is accessible and usable to twice as many people as it was 5 years ago. We're looking at things like the size and reach of websites like Wikipedia, and also referring to our search for more creators and users of information.
  3. We are disproving incorrect hypotheses at such a rate that we are doubling human knowledge by unlearning untruths, which corrects mistakes and frees up time for productive learning.

But this is a bit simplistic.  If I know one thing and learn another, have I doubled human knowledge or just what I have learned? Is a learned fact human knowledge before it is published?

If I know one thing and, without learning another, communicate that one thing to another human being, does that double human knowledge because two of us now know it? Or is that just repetition?

If I discover an hypothesis is not correct, does it make any difference if others continue to hold the old and incorrect belief?

And then we must make things difficult by defining knowledge. Many of you will have seen the pyramid that has data at the bottom, information on top of it, then knowledge and then wisdom. The implication is that data is a fact revealed, information is the fact in context, knowledge is is the fact absorbed and assimilated, and wisdom is right action based on knowledge as experienced. But is that an accurate or adequate definition of knowledge? Is it sufficient?

Wikipedia has a definition--it's a bit long, but the start is promising. Of course the start is a quote from Oxford's English Dictionary. "Knowledge is defined (Oxford English Dictionary) variously as (i) facts, information, and skills acquired by a person through experience or education; the theoretical or practical understanding of a subject, (ii) what is known in a particular field or in total; facts and information or (iii) awareness or familiarity gained by experience of a fact or situation." Yeah, that's it.

So does learning that the Earth is more than 6,000 years old mean anything if millions don't believe it? If creationists make learned scientists spend time in court patiently explaining evolution instead of working in the laboratory--their true 'field of the Lord,' have we learned anything? Does knowledge have to be unanimously held, or unanimously available?

Boy, where's a philosopher when you need one?

04/30/2007

The Pot of (Black) Gold

Oil Shale Extraction could bring access to 1 trillion barrels of oil in the United States. Canada has even more. If some biblical figure could strike his staff against the stone and bring forth oil, he (or she) would be a very rich biblical figure, indeed.

But it's a bit tougher than that. An interesting article about the state of oil today notes that "companies developing tar sand production in Alberta announced that their costs of production were rising substantially."

Sounds like a job for science and technology... especially as biblical figures are a bit short on the ground these days. With rumors of oil production about to decline and continued resistance to nuclear power, someone who can bring this large scale energy source to our gas tanks will be heavily rewarded.

So what is happening in this subsector of a subsector? There are academic institutions that focus on engineering and others that focus on energy. There are patents to be filed, alliances to be made, deals to be done and advances to be discovered. Again, like nanotechnology and global warming, this is one area where we should expect to find a doubling of human knowledge every five years--at least.

Better yet, from our point of view, we've been extracting oil from shale since 1847 (those canny Scots at it again), so there should be a time series of publications for us to study--almost as long as our study of studying Mars.

We did find one, at Scirus again. It goes back to 1920, but only gives results for the decade up til 1950.

Taking an average of 4 journal articles for 1920 and 115 patents for the same year, we get CAGR (Compound Annual Growth Rate) of 5.59% for journal articles, and 2.61% for patents. As shown below, you can easily tell when oil prices rose.

But comparing 2002-2006 with the five years preceding, we can quite clearly see the effects of commerce on knowledge seeking. Journal articles during this period rose by 13.94% for the entire 5-year period. But patents--they rose by 99.19%, doubling within 5 years.

We see similar effects when measuring output in the 70's. Journal articles grew more strongly--at 245.86% for the five year periods. Patents grew strongly, but only rose from 1,098 to 1,730 or 57.56%. In fact, it makes me wonder if the oil shocks of the 70's drove academic interest immediately, which led to a knowledge base being available for patent seekers when prices rose again after 2001.

The story of oil shale and human knowledge looks much more like punctuated equilibrium than the steady advance of human knowledge, and what punctures the equilibrium is economics.

Results below...

Continue reading "The Pot of (Black) Gold" »

04/28/2007

Back to Global Warming

And back to Scirus, the compendium of scientific publication. Okay, it's a search engine returning titles of journals, patents, etc.

What are we to make of the fact that searching for global warming on Scirus returns 100,000 more results than a similar search for nanotechnology? It's been studied for 20 years longer than nanotechnology, so global warming should reflect that... But there are 8,000 fewer academic articles in the database (28,000 for nanotech vs. 20,584 for global warming). What does that mean?

We found 5,184 patents on the Scirus database for nanotechnology. For global warming... 9,419. Scientists and technologists have been studying global warming for more than twice as long, so that may mean nothing. But, let's look at the statistics and see what they tell us.

As shown below, playing by the same rules as we did for nanotechnology and the study of Mars, we hereby proclaim a base year of 1975, as it saw 18 journal articles published and one patent filed. That gives us 31 years to measure growth. This indeed gives us a CAGR (Compound Annual Growth Rate) of 16.85%, which in fact does show doubling every five years! Hooray, we have another winner!

The CAGR for patents is higher yet--26.08%. (But again, it seems to have leveled off in 2002.) We saw below that growth was disappointing in the subsector of climate modelling, but it's being more than made up for elsewhere.

But, given the potential gravity of the consequences of global warming, am I the only one to look at the growth rates of human knowledge in this field, compare them to the growth rates for nanotechnology, and feel slightly disappointed?

The data is below. Click if you want to see the numbers.

Continue reading "Back to Global Warming" »

But This Isn't Just About The Singularity, Dammit!

The Singularitarians need an exponential growth of human knowledge to enable their theory of short term artificial intelligence.

But I submit they are not the only ones who are changing their behaviour and expectations because of this general idea floating around. The idea that human knowledge doubles every five years can become an excuse for inaction.

If people think that human knowledge doubles every five years, they may think that improved science and technology will provide a 'deus ex machina' last minute cavalry charge over the hill solution to something, meaning they don't have to change their current behaviour.

They might continue smoking, thinking that surely, by the time they're 69 (the average age of diagnosis for cancer), we will have found a silver-bullet solution for all cancers.

Their organisations don't modify damaging behaviour, thinking that we will find a way to turn pollution into ice cream and champagne, or replace depleted fisheries with genetically modified de-boned cod that jump into our nets with a smile.

I think this attitude, to the extent that it exists, is quite dangerous. If my first pass through publicly available data suggests anything (and even when I finish it will need to be approached academically), it suggests that human knowledge does not double every five years.

What we do see is that specific sectors, such as nanotechnology or proteomics, grow even faster than that for short periods of time:

  • When the sector is new
  • When the sector is sexy
  • When government or private industry sees a short term gain resulting from it
  • When intermediate goals are visible and attainable

But exponential growth then reverts to a mean, or returns to an average growth that is far slower. That growth rate varies according to the availability of inputs, such as researchers, funding and above all the availability of findings from outside the given sector--the multi-disciplinary rescue. This has rescued Moore's Law more than once, and it is applicable across fields.

Not every problem deserves the resources of a Manhattan Project, which is the prime example of how throwing money and scientists at an issue can produce results. Some problems can be solved without a Manhattan Project. But I fear that problems like lung cancer and artificial intelligence do need resources at that scale...

...and I fear even more that a general, sloppy impression that human knowledge doubles every five years without any real effort or a need to get behind the knowledge machine and push is actually harming the cause of scientific and technological progress. It relieves us of personal responsibility. Oh, the new horde of Chinese researchers will solve it, so we don't have to.

That is pernicious.

04/25/2007

What Is The Singularity?

Okay. Here is the definition provided by the Singularity Institute for Artificial Intelligence (SIAI):

"The Singularity is the technological creation of smarter-than-human intelligence. There are several technologies that are often mentioned as heading in this direction. The most commonly mentioned is probably Artificial Intelligence, but there are others: direct brain-computer interfaces, biological augmentation of the brain, genetic engineering, ultra-high-resolution scans of the brain followed by computer emulation. Some of these technologies seem likely to arrive much earlier than the others, but there are nonetheless several independent technologies all heading in the direction of the Singularity – several different technologies which, if they reached a threshold level of sophistication, would enable the creation of smarter-than-human intelligence."

The Institute is described on Wikipedia here. I emailed the founder, Eliezer Yudkowsky, to get his thoughts on what I am doing here. He hasn't responded as yet.

For 'Singularitarians', the Singularity is a big transformative concept, one of those spectacular theories that will change human life as we know it. When we create intelligence greater than our own, we will solve problems that we cannot solve today. Furthermore, many of the technologies (and especially nanotechnology) that will be used to create artificial intelligence will be used by the artificial intelligence to help us with healthcare, energy (PDF), space travel and more. God help us if we create an artificial intelligence that wants to sit on the couch and watch the soaps.

Those who fear artificial intelligence, nanotechnology (and perhaps big transformativc concepts), manage to create a vision of some rough beast slouching towards Bethlehem waiting to be born. From fears of grey goo (nanotechnology run wild) to the destruction of human ambition, AI is a threat--perhaps the biggest outsourcing threat of all, especially if it is embodied in robots.

04/22/2007

Nanotechnology and Human Knowledge

There's a search engine called Scirus for scientific information. A search performed on Sunday, 22 April 2007 with the search term 'nanotechnology' returned 637,500 results. Fortunately, we can winnow that down to 28,846 journal results and sort them by date. (We also found reference to 5,148 patents) So, here we go again... 23,489 of those journal articles were published after 2002 ended. That's 81% of the total. Similarly, 4,494 patents were filed since the end of 2002, 87% of the total.

Using 1987 as a base year, we find CAGR in journal publications of 42.03%, which would double in 1.98 years. For patents, we see a CAGR of 44.91%, which would double every 1.87 years.

And this is why Ray Kurzweil and the rest of the Singularity gang get excited. Remember we need annual growth of 14.87% to double in five years. Nanotechnology has got it in spades.

Last year's growth was 37.87%. Maybe they will take over the world... but I will explain later why I think growth in nanotechnology is a necessary, but not sufficent pre-requisite for the Singularity.

Note that the advanced search by year  yielded 28,863 journal results (but it found all 5,148 patents )(one reason these types of databases drive academic researchers crazy--I hear Google Scholar is worse, and I'm afraid I'm going to find out soon...)

Click if you want to see the numbers

Continue reading "Nanotechnology and Human Knowledge" »

Chinese Papers in Nanotechnology

Well, I was going to wait before starting on nanotechnology, but I don't have a convenient place to park this information, so you get it in toto:

From PhyiscsToday.org: "A paper by Ping Zhou and Loet Leydesdorff (PDF) suggests that China is rapidly developing research expertise in nanotechnology. Six years ago, US-based scientists published 50% of the papers in the journal Nanotechnology and China accounted for zero. Today 25% of papers in the journal are from US-based scientists and 15% from China. Zhou and Leydesdorff report that between 1999 and 2005, for 85 journals in the fields of chemistry, physics and material sciences that are relevant to nanotechnology, the number of papers published by China rose exponentially to 8.34%, while papers published by other countries either remained steady or declined. China's overall world share in science journals is 6.52%, suggesting that China is aiming to be at the forefront of research into nanotechnology."

The paper this article is based on talks about a ratio between GDP and R&D, the return of overseas scholars and other factors. The paper then details research that looks like what we're trying to do here--except they're doing it a lot more... scientifically. They even call it scientometric analysis! I'm going to steal that...

Where I am using crude statistics available for free and for fun, they use "indicators like total publications, total citation rates, percentage of world share of citations, as well as the top one percent of most highly cited papers in order to measure scientific output."

Something else I'm going to have to look at--"For the input indicators, we used the OECD's Main Science and Technology Studies published online and in print (OECD 2004)." Like me, they experienced difficulty in searching large databases of publications. They found their own solution, but it won't help me going forward.

Some interesting findings:

  • Between 1999 and 2004, China moved from 10th to 5th place in terms of overall scientific publications
  • The overall US share has dropped from 35% to 30% between 1992 and 2004 (but if the overall number is rising, who cares?)
  • Countries where English is commonly spoken (and written, obviously) appear to have an advantage when it comes to having your paper cited by others
  • More than 4,400 science and technology journals were published in China in 2001
  • Half a million articles are published annually in Chinese journals
  • It appears that Chinese scientists read the work of scientists in other countries, but that international scientists don't read (or at least don't cite) work published in Chinese journals (with the exception of some Chinese journals that are published in English)
  • The Chinese government declared nanotechnology a critical R&D priority in 2001, and a group of commissions and ministries issued a strategic plan for development of nanotechnology through to 2010
  • Definitions are tough. It was hard for the researchers to find all nano-related papers, and only nano-related papers. (I know what that's like--and what it means. You actually have to read, instead of count, and that takes time)
  • The paper's authors use really, really cool graphs to show their work. I am extremely jealous

Now I just have to find out where they got those numbers at the end of the news release...  they could have listed those 85 journals...

Human Knowledge and Mars

(This part is repeated from the previous post): NASA and the Smithsonian Institute maintain a database called the Astrophysics Data System. Harvard University is kind enough to host it here. Available to you and I is an abstracting service which will return brief descriptions of articles found by keyword searches. You see where I'm going here?

For a description of coverage and limitations of the database, click here. Then, let's have some fun.

A search for the word Mars in titles and abstracts performed on April 21 2007 returned 20,978 results.

(This is where the new stuff starts): Here we report on 5-year periods going back to 1900. We show annual percentage growth on the previous 5-year total. It isn't pretty--much more like real life than any smooth logarithmic curve. In 22 five-year periods, we show an annual percentage growth above 14% (shown in bold) in only 3 five-year periods(you need a 14.87% annual growth to double every five years). In 7 of the five-year periods, there is a decline in publications (inefficient post, wars, depressions, Mars being further away... lots of logical reasons why this might have occurred).

The serious question is, if our studies of Mars, which have benefited from major government funding from several large governments, and have also benefited from being topical, exciting, and having large sources of data available recently, do not show exponential growth, where would we expect to actually find a doubling of human knowledge every five years? If not here, where?

What I think it shows me is that when there is a lot of funding, real news, a research goal, and the availability of new and important data, we focus on the subject, do the research and improve knowledge. But for extended periods, things just sort of tick along, improving respectably in many years, but showing declines in others. What do you make of the following?

Click if you want to see the numbers.

Continue reading "Human Knowledge and Mars" »

Human Knowledge and Mars

NASA and the Smithsonian Institute maintain a database called the Astrophysics Data System. Harvard University is kind enough to host it here. Available to you and I is an abstracting service which will return brief descriptions of articles found by keyword searches. You see where I'm going here?

For a description of coverage and limitations of the database, click here. Then, let's have some fun.

A search for the word Mars in titles and abstracts performed on April 21 2007 returned 20,978 results. Let's pretend for the sake of argument that this represents all human knowledge ever gained about Mars. (This may be as close to the truth as we will ever get on this weblog when searching for information about fields of study--but it is obviously not true, so take it with a grain of salt.)

What is the CAGR (Compound Annual Growth Rate) in publications? Well, if we were really strict, we would measure from the date of the first document, which is... 1644. That would give us a CAGR of 2.32% and we would all get very depressed. But a more realistic starting point would be 1830, when regular publications started. This would remove only 10 documents from our total, and would yield a CAGR of 5.78%. That's much better, but far short of doubling every five years. (At that rate, human knowledge about Mars would double every 12.34 years.) Nonetheless, half of the documents in the database have been published since 1993. (Hey--that doubled in 14 years... is the rate of doubling slowing down?)

Just the metadata I got from the site is fascinating, and I'll be referring to it later, looking for short term bursts and lags (and wondering if Mars' distance from the Earth is one of the causes, for example). But here's some interesting information.

The most recent, published in April 2007, is titled "Subsurface Radar Sounding of the South Polar Layered Deposits of Mars and written by Plaut, Jeffrey J.; Picardi, Giovanni; Safaeinili, Ali; Ivanov, Anton B.; Milkovich, Sarah M.; Cicchetti, Andrea; Kofman, Wlodek; Mouginot, Jérémie; Farrell, William M.; Phillips, Roger J.; and 14 coauthors

Document #3001 is titled "Review of the trajectory and atmospheric structure reconstruction for Mars Pathfinder", written by Withers, Paul; Towner, Martin; Hathi, Brijen; Zarnecki, John

Document #6001 is titled "History and perspectives of scientific ballooning," by Frank Lefevre. Hmm. Well, document #6002 is "Stability and evolution of the climate system of Mars," written in 2001 by Stability and evolution of the climate system of Mars

Document #9001 is titled "Mobilization of dust on the Mars surface by the impact of small cosmic bodies," written in 1997 by Rybakov, V. A.; Nemtchinov, I. V.; Shuvalov, V. V.; Artemiev, V. I.; Medveduk, S. A.

Document #12001 is titled "Dry Carbonate Formation on Mars: A Plausible Sink for an Early Dense CO2 Atmosphere?", written in 1990 by Stephens, S. K.; Stevenson, D. J.

Document #15001 is titled "Stratigraphic relationships within Martian polar CAP deposits" written in 1982 by Howard, A. D.; Cutts, J. A.; Blasius, K. R.

Document #18001 is titled "The major Martian dust storms of 1971 and 1973" written in 1974 by Martin, L. J.

The very first document, written in 1644 by Grandami, Jacques, is titled "Nova demonstratio immobilitatis terrae petita EX virtvte magnetica."

If that's a little dense for you, the second, written in 1672 by Flamstead, John, is "An Extract of Mr. Flamstead's Letter Written to the Publisher from Derby Novemb. 16. 1672. Concerning the Appulses, by Him Calculated for A. 1673. of the Moon, and the Other Planets to Fixed Stars; Together with an Observation of the Planet Mars, Made by the Same"

04/21/2007

Human Knowledge and Mars

I off-handedly chose Mars as a subject for investigation, basically because there have been several missions to the Red Planet in the past 5 years. Those missions were expensive, they were designed to gather knowledge about Mars... so if human knowledge is doubling every 5 years in any given sector, we might imagine that one of the sectors is Mars. Well, I actually have an ulterior motive, but that's for the end of this post.

The Exploration of Mars is beautifully detailed on Wikipedia. The first probe to actually get close to Mars was in 1962, the Soviet mission Beta Nu 1. It was followed by the US Mariner 4 in 1964. So we have been collecting data from space missions for 45 years. The 1996 Mars Global Surveyor effectively returned more data than all previous missions combined, and the data is available to the public.

The Mars Pathfinder landed on Mars on July 4, 1997, and a remote controlled rover called Sojourner decamped from the lander and went walkabout. Pathfinder returned 16,500 images from the lander and 550 images from the rover, as well as 15 chemical analyses of rock and soil and extensive data on winds and weather. This was followed by a spate of unsuccessful attempts to return to Mars.

In 2001 the run of bad luck ended when NASA's Mars Odyssey orbiter arrived. Its mission is to use spectrometers and imagers to hunt for evidence of past or present water and volcanic activity on Mars. In 2002, it was announced that the probe's gamma ray spectrometer and neutron spectrometer had detected large amounts of hydrogen, indicating that there are vast deposits of water ice in the upper three meters of Mars' soil within 60° latitude of the south pole.

NASA sent a pair of twin rovers toward the planet as part of the Mars Exploration Rover Mission. On 10 June 2003, NASA's MER-A (Spirit) Mars Exploration Rover was launched. It successfully landed in Gusev Crater (believed once to have been a crater lake) on 3 January 2004. It examined rock and soil for evidence of the area's history of water. On July 7, 2003, a second rover, MER-B (Opportunity) was launched. It landed on 24 January 2004 in Meridiani Planum (where there are large deposits of hematite, indicating the presence of past water) to carry out similar geological work.

Mars Reconnaissance Orbiter is a multipurpose spacecraft designed to conduct reconnaissance and exploration of Mars from orbit. The $720 million USD spacecraft was built by Lockheed Martin under the supervision of the Jet Propulsion Laboratory, launched August 12, 2005, and attained Martian orbit on March 10, 2006.

The MRO contains a host of scientific instruments such as the HiRISE camera, CRISM, and SHARAD. The HiRISE camera is used to analyze Martian landforms, whereas CRISM and SHARAD can detect water, ice, and minerals on and below the surface. (In case you haven't guessed, I'm just copying from Wikipedia here.)

Now, the Wikipedia article on Mars itself is quite long. It contains very little material that could have been written before the past 45 years--the planet was mapped in 1840, some basic astronomical information has been long known, but almost all of what Wikipedia thinks is important is recent, and most of it really, really recent. Of the 79 citations at the bottom of the article, 30 were written after 2004.

So I'm ready to admit that human knowledge about Mars has doubled every 5 years since 1962. I'll be searching for more evidence to back this up. But here's my ulterior motive: I think that new sectors, or sectors that have big money thrown at it, or sectors where there is a clear public good to be obtained from applied research, might well indeed experience exponential increases in knowledge. But are they sustained? Does knowledge gained transfer readily to similar sectors? (Do we, for example, know as much about Venus or Mercury?) If new knowledge gained in the past 5 years about Mars (and I include getting to Mars, data collection design, etc.) leads to similar increases in knowledge about other planets in our solar system, then I think we're on to something here.

Wow!

I was about to click out of the Lawrence Livermore Laboratory's website when I stumbled across this news release: "Lawrence Livermore National Laboratory’s institutional “Grand Challenge” scientific computing program has allocated 83.7 million CPU hours to 17 research projects ranging from astrophysics, chemistry, materials and biosciences to earthquake and climate simulation on Laboratory supercomputers."

... "Research projects were selected by the Laboratory’s Deputy Director for Science and Technology, the Laboratory Strategic Program Board and the Laboratory Science and Technology Office. To be considered, proposals had to “address a grand-challenge-scale, mission-related problem that promises unprecedented discoveries in a particular scientific and/or engineering field of research, and if successful, will result in high-level recognition by the scientific community at large.”

... "For example, a multi-institutional group of climate scientists will conduct the most detailed multi-decade global climate simulations ever performed as part of the larger effort to understand how climate changes over time. “By performing these climate simulations of unprecedented detail and fidelity and making them available to the climate research community, we have the potential to dramatically advance our understanding of climate and how it may be affected by anthropogenic factors,”

Grand Challenge recipients have been allocated time on Atlas, the new 44 teraFLOP (trillion floating operations per second) machine and Thunder, a 22 teraFLOP machine -- systems dedicated to unclassified research through the Laboratory’s Multi-programmatic and Institutional Computing program. Central processing unit or CPU time is measured across the multiple CPUs in a computer. For example, two CPU hours can be one CPU used for two hours or two CPUs used for one hour. High performance computers generally consist of thousands of CPUs; the Atlas system contains 9, 216 CPUs."

You know, it's stories like this that make me optimistic about the future.  "Over the last 10 years, high performance computing resources dedicated to unclassified institutional research have increased more than a thousand fold from 72 giga (72,000 million) FLOPS in 1997 to 81 teraFLOPS today."

This is what the Singularity protagonists are pointing to. Prosaic sad sacks like me are stubbornly waiting to see the results of the research performed. Something inside me grumbles that a well-designed program wouldn't need 82 million hours to execute... (this from someone who's never written a line of code in his life).

On the other hand, there are specific sectors where this exercise in and of itself might result in the doubling of human knowledge.

Global Warming and Hypothesis 3

If you've read the beginning days of this weblog (which today celebrates two weeks of actual existence), you might recall that I am trying to test three hypotheses. The third is that we are disproving (or falsifying) incorrect assumptions, hypotheses and opinions at a rate that effectively doubles human knowledge (knowing something is not true is really, really important). This is just as important in global warming as with anything else.

Critics of global warming have for years pointed to the fact that satellite observations, taken from a single dataset, showed slight cooling in the tropical troposphere since 1979. Since this conflicted with what climate models predict, this was a serious problem for those attempting to warn the public about global warming.

So they went and got more data, which is what you're supposed to do. This 2005 news release from the Lawrence Livermore Lab (close to where I grew up), shows that further research found better data that agrees with modern climate models.

As there is still considerable controversy about global warming, there's a whole lot more falsifying and disproving that needs to be done, probably on both sides of the fence. If you think about it, the  beneficiaries of this disproof of the previous data are the critics of global warming as well as those who strongly believe, as critics now actually have to examine their assumptions, look for more or better data, or find explanations for what this research showed.

And Still More About Human Knowledge and Global Warming

If you go to climateprediction.net you can volunteer your computer's spare time to help run climate models. I found the site thanks to an article at NewScientist.com. The project, started in 2002, appears to have run the Hadley model 172,229 times (at the time I checked), apparently with minor changes in assumptions to check on the different outputs that result. Check here for the aggregated results of the first 27,000 runs. (I think--I didn't spend a whole lot of time on the site.)

This is exactly what I would expect to find if human knowledge is advancing rapidly and incorporating multi-disciplinary tools and knowledge to forge ahead. Hope I find a lot more like this.

Yet More on Human Knowledge and Global Warming

Okay, stick with me here. The IPCC summary for policy makers (PDF) of their draft report on Global Warming was issued in February--the draft will be released in May. It used data from over a dozen climate models in preparing their reports. The 2001 report relied heavily on a model developed by the UK's very own Hadley Centre. The policy summary notes the following problems with the climate models they used:

  • systematic biases in simulation of the Southern Ocean, which is important for the transfer of heat between the atmosphere and the oceans
  • on-going problems in simulating the El Nino - Southern Oscillation (ENSO) cycle, which is a major factor in the Earth's climates
  • poor simulation of precipitation events: "in general, models tend to produce too many days of weak precipitation (<10 mm per day) and too little precipitation overall  in intense events (>10 mm per day)"
  • substantial uncertainty in the simulation of feedback from sea-ice, which are coupled with polar cloud formation and transpor of heat through the polar oceans

Given the potential gravity of worst-case scenarios of the effects of global warming, you cannot wait for perfect data before attempting to make policy decisions. But these are not trivial issues.

This should be a hotbed of scientific activity, and we should see lots of research being done on the issues above. In particular, we should see attempts to harness Moore's Law, which baldly states that computer power has doubled every 18 months to 2 years in the recent past to make climate models more detailed, easier to test and quicker to run. So what do we find?

Here are some positive indications:

  • (I thought this was much more exciting until I looked at the publication date of 2002) (From Science Daily) "Atmospheric scientists from Lawrence Livermore National Laboratory have performed the first global climate simulations with spatial resolutions of roughly 50 km (30 miles). This capability will be used to assess climate change and its societal impacts. Typical global climate simulations use spatial resolutions of about 300 kilometers (186 miles), which limits their ability to simulate climate and climate change on a regional scale "
  • This (also from Science Daily, but from 2007) is the type of article I was hoping to find: Open Source Software Toolkit Plays Key Role In New Climate Simulations  Science Daily — The Model Coupling Toolkit created by the U.S. Department of Energy's Argonne National Laboratory played a key role in the climate simulations used in preparing the new U.N. report "Climate Change 2007: The Physical Science Basis." The Model Coupling Toolkit (MCT) is an open source software library for constructing parallel coupled models from individual parallel models. MCT is designed for high performance and portability. All of the simulations of the Community Climate System Model used the Model Coupling Toolkit." This is good--although the skeptic in me wonders if it means that any errors in the Model Coupling Toolkit are therefore replicated in all models that use it...
  • The University of Arizona researcher Joellen Russell, working with a group of researchers from National Oceanographic and Atmospheric Administration/Geophysical Fluid Dynamics Laboratory, Princeton, New Jersey, published (in December 2006) a paper called "The Southern Hemisphere Westerlies in a Warming World: Propping open the Door to the Deep Ocean,” which may specifically address some of the IPCC study's self-identified areas of weakness. Although I'm just interested in the fact that the research took place within the past 5 years, the results (that the Southern oceans can absorb more carbon dioxide and additional heat, perhaps reducing the rate of global warming), are not in strict accordance with what the IPCC is publishing.

I'll do a sequel to this post very soon. I don't like blog posts that go on forever, and besides, breakfast is waiting.

04/20/2007

Authorgeddon and Book Publishing

Well, I guess we need to hope that the Singularity happens before 2052. That's the year when Authorgeddon is predicted to happen, according to Wikipedia. In case you're wondering, Authorgeddon is the moment in time when the number of books published in the U.S. is expected to surpass the number of people in the U.S. who are willing to admit having read a book in the previous year.

"A study by print on demand company Lulu.com pinpoints 2052 as the year when Authorgeddon will arrive. The study arose from statistics published by R.R. Bowker, the company that controls ISBN distribution in the U.S., which reported that the number of books published in America in 2004 hit a record 195,000 -- a 14% increase over the previous high of nearly 175,000 recorded the year earlier. Bowker reports that the average annual rise in published titles over the last three years has been 14.6%."

That 14% increase looks pretty interesting--if it held up it would mean that the number of books published in the U.S. would double every 5 years. So, how does that stack up to real publishing statistics?

From Bookwire (owned by RR Bowker):

1993: 104,124

1995:  113,589

1999:  119,357

2000:  122,108

2001:  141,703

2002:  147,120

2003:  171,061

2004:  190,078

Sadly, in 2007 the number of books published in the U.S. plummeted to 172,000 new titles (I'll bet it's that damn Internet thingie). The overall CAGR is 3.65%, which would double every 19.33 years. But even looking at statistics to 2004 only (2005 might have been a blip, after all) it gives us CAGR of 5.14%. So even by cherry-picking the time frame to include the rosiest picture of growth, we get about 1/3rd of the growth we need. The concept of human knowledge doubling every five years did not originate with US book publishing statistics, I'm afraid.

04/19/2007

Back to Work

I'll be writing a lot more about the Singularity later. Let's go back to the basic building blocks that will educate my later writing.

We return to the numbers of people in higher education. If this number is growing exponentially, there might be reason to think human knowledge will too, in future. Specifically, in India, we find "India's provision for higher education rose from a meagre 200,000 at the time of Independence to an astronomical 5,000,000 college and university places by 1998." Those words come from Gajaraj Dhanarajan, President and CEO, The Commonwealth of Learning, Vancouver, Canada. He was speaking in August of 2000 at the 11th General Conference--Universities as the Gateway to the Future.

Let's see--India achieved independence in 1947 (right? Might be off a year--Nope. Google says August 15, 1947). If Indian university students had doubled in number every five years since then, starting from a base of 200,000, they should now have 516 million university students. So, if they have started growing exponentially, it is recently. (How many university students should India have? Using figures from Dhanarajan's speech, to have the same rate of university enrollment as Canada {5%}, for example, India should have had 50 million university students in 2000. Room for improvement here).

This is actually fairly important. The Compound Annual Growth Rate (CAGR) for the Indian university population is 5.51%. The CAGR needed to double in five years is 14.87%. A CAGR of 5.51% means the population doubles every 13 years, instead of five. We won't get where we need to be at this rate.

If you've been reading from the bottom up, you'll be able to join in the refrain--"Great growth. Not doubling."

One... Singular Dilemma

Singularitarians say that computing will advance to the stage where artificial intelligence outstrips human capabilities and then will happily begin designing even better artificial intelligence. I hope they're right.

But hand on heart, I honestly believe that if we had the magic box (or network of boxes) that could compute as quickly and efficiently as it will in 2035 (one of the dates proposed for arriving at the Singularity), we couldn't make it work. We couldn't program it to do what we expect AI to do at the time of the Singularity.

Human knowledge will have to increase in order to make the box work. I suspect (and am searching for evidence) that it will have to increase exponentially. (The alternative is finding a genius, which could happen at any time, but we can't count on it. That genius may exist, but have died yesterday in Darfur or Baghdad.)

If human knowledge is not increasing exponentially (or if we don't find and train the right genius), I don't think the Singularity can happen. Ever.

Time to Get Serious

If you type the search string "human knowledge doubles every five years" into Google today, it will return about 1,580,000 results.

The first result is from a Bellagio newsletter reporting on a conference, and it uses the sentence as if it were a given. The second is from President Clinton's 1998 speech to the AAS, where he asserts it as a given. The first post in this weblog quotes from it.

The third result is why I started this weblog. Here is part of the opening from The Low Beyond, by Eliezer Yudkowsky.

1: The End of History

It began three and a half billion years ago in a pool of muck, when a molecule made a copy of itself and so became the ultimate ancestor of all earthly life.

It began four million years ago, when brain volumes began climbing rapidly in the hominid line.

Fifty thousand years ago with the rise of Homo sapiens sapiens.
Ten thousand years ago with the invention of civilization.
Five hundred years ago with the invention of the printing press.
Fifty years ago with the invention of the computer.

In less than thirty years, it will end.

At some point in the near future, someone will come up with a method of increasing the maximum intelligence on the planet - either coding a true Artificial Intelligence or enhancing human intelligence.  An enhanced human would be better at thinking up ways of enhancing humans; would have an "increased capacity for invention".  What would this increased ability be directed at?  Creating the next generation of enhanced humans.

And what would those doubly enhanced minds do?  Research methods on triply enhanced humans, or build AI minds operating at computer speeds.  And an AI would be able to reprogram itself, directly, to run faster - or smarter.  And then our crystal ball explodes, "life as we know it" is over, and everything we know goes out the window. "

Yudkowsky, like Ray Kurzweil and other advocates of what is known as the Singularity, extrapolates progress in computing sciences from past performance increases in computer efficiency, networking and falling prices for computation and memory. At some point in our lifetimes, computing will be so fast, so efficient, so networked and have access to so much memory, that we will be able to create artificial intelligence (AI). We will then be able to assign this AI the task of creating an even better computing machine, as well as assisting human intelligence in solving real world problems. Because of the increased capabilities of AI (which include indefatiguability and no need for coffe breaks as well as very high intelligence and concentration), progress will be so rapid that life as we know it will change dramatically and in a very short timeframe.

I really want this to be true. But I am afraid that a) we are exaggerating our progress on many fronts and b) even if we can build this better box, we will need to improve our knowledge exponentially before the box is built so we know what to put into it. And I see no evidence as yet that we'll be able to fill the box.

We'll be talking a lot about this--and I hope to involve some of the protagonists in the discussion. But if this isn't your cup of tea, you can adjust your visits to this site accordingly.

Copyright 2007

  • Thomas Fuller

My Target List

Blog powered by TypePad
Member since 11/2006