An unemployed gentleman scholar

A comment to a previous post asks me for some personal information: “I’ve noticed that you never list a university, firm, or non-profit affiliation on your papers or website. Would you mind writing a post about how you got to be where you are, who supports your work, the reactions of reviewers to papers from outside the university/well-known industrial research lab circle, and the like? I for one would be terribly interested in your personal ascent to greatness outside the establishment.” Well, I’m as susceptible to flattery as the next guy, so here goes. First off I should say that my career path was not planned or plannable, so you can’t follow in my footsteps except to go where there is no path.

From a young age (7 years old, Luna 9 moon landing pictures 1966), my vocation was to be a scientist, and I quickly settled on theoretical physics because physics is everything — any explanation of the natural world eventually bottoms out in the laws of physics (yeah, I’m a hard-core reductionist and proud of it). At high school and college, I was pretty good at math and physics, but once I started my PhD I got a shock because I discovered that many people were much smarter than me. A few months after starting my postdoc I concluded that I would always be a mediocre physicist and would never make much of a contribution, so I quit. That was very difficult because physics had been my life’s dream, but in retrospect physics was the wrong field for me; I’m more of a natural programmer than a natural mathematician. Like many failed physicists in the mid-80s, I got into the software business and ended up starting my own company in San Francisco. I sold the business to Intel in 1999, and in 2001 I was burned out and quit with no idea what to do next.

I didn’t want to start another business, which is what most entrepreneurs do, and I didn’t want to retire and play golf (or more likely tennis). I knew about the human genome project and the importance of software algorithms — it was very appealing to me as a software guy that you could do important things just with strings of letters without knowing all that tedious biochemistry. Biology=strcmp(). So while it didn’t occur to me that I might actively work on that stuff, I thought it would be fun to learn something about it, and crashed a seminar at UC Berkeley and met a newly-hired professor named Kimmen Sjolander. She was looking for students to help her code up some algorithms for summer research experience, and I volunteered. The result was a multiple alignment program called SATCHMO, which worked pretty well but was no better than CLUSTALW according to BALIBASE, which was the only available benchmark at that time (2003).

Being a competitive guy, I was dissatisfied with that, and started a systematic project to figure out what did and didn’t work in multiple alignment algorithms, because it seemed to me that the programs had many ideas in them, but it wasn’t clear which of the ideas helped or hurt the results. At that time, Gotoh’s PRRx had the best accuracy (except maybe for T-Coffee; I don’t remember) but the code was unfriendly and slow and hardly anybody used it. So I started from the PRRx algorithm and systematically varied the elements of the algorithm with all the alternatives I could find in the literature or dream up myself. That was the line of work that led to MUSCLE. (As an aside, the MAFFT people followed a similar strategy independently, and it’s remarkable how much the original MAFFT and MUSCLE v1 resembled each other. MAFFT got published first and was better than my first attempt; I had to ‘borrow’ a couple of their ideas in order to do better than MAFFT. For example, we both had k-mer counting to build the first tree, but I was using 3-mers in the usual alphabet and they used 6-mers in a compressed alphabet which gave slightly better results. A nice idea on their part, but in retrospect I shouldn’t have copied it because it is a BALIBASE artifact so a bad example of fishing for significance).

So to answer the question: after selling my business I had some financial independence, and I’ve supported myself and my research from my savings for the past decade. You can think of me as unemployed, independent and/or a gentleman scholar of modest means (like, say, my fellow-countryman Charles Darwin). It’s hard to know how people have perceived my unconventional status. Everyone feels misunderstood and dismissed by peers sometimes (reviewers, editors, conference organizers…), so it’s impossible to say whether I would have been better accepted if I had a conventional affiliation. MUSCLE has been helpful because many people have heard of it (almost 3,000 citations so far per Google Scholar). That gave me some street cred early on, and surely helped open other doors.

I don’t see any lessons here to help young scientists, so in an attempt to avoid becoming a bad influence I will defer to a more accomplished scientist who has much better advice. I highly recommend this lecture from Richard Hamming. (If the link breaks, google “you and your research” “richard hamming”).

7 responses to “An unemployed gentleman scholar

  1. When I read your MUSCLE paper, I had a similar question: where does this guy come from? This is not a typical computer science guy and he seems to dig very deeply into the problem. After I read your story, I have to say you are a great guy.

  2. Your background is actually fascinating and an inspiration to people who want to pursue non institutional science. Someone should write a story about you. I’m currently a phd student in search of a non-traditional outlet. I have similar flexibility in terms of funding but was unable to find other examples of independent science with the exception of Dave Shaw. Traditional academia is wrought with hidden politics, not to mention the conflicts of personality and other intricacies that are obviously not advertised very well. (RIP Jason Altom). It’s great to see people doing high calibre work without having to bend to those pressures. Keep it up!

  3. This is an inspiring story. Yes, yes– I carefully read your disclaimer to not be drawn in by a ‘bad influence’ :P. Thanks for sharing.

  4. I am going to finish my PhD. soon and I have exactly the same feeling you had: a lot of people in my field are much smarter than me… It’s a relief that I am not the only one 🙂

    I have to make a decision soon about my future, I am sure I will not follow your steps, but still this was a good reading, for me.
    Thanks!

  5. Wow, I use muscle now all the time and I had no idea you are a self-made “gentleman scholar”. A fellow countryman of Darwin? And where do you live, in the UK? I lived and worked as a postdoc in London for a year and I could easily envision what a comfortable life one can have there with your status. Assuming you can afford double-glazed windows 😉

  6. It happens all the time. You are in good company – Grisha Perelman, Sasha Grothendieck, Barbara McClintock and many others who were voluntary or involuntary self-employed. Think about Mozart, Bach or Beethoven. Poor Bach – I was about to cry reading his complains on lost bottle of cheap wine he sent to somebody as as gift. I was surprised to learn that nobody knows where Leonardo’s grave. You simply imagine that Leonardo’s job description included making a toys for entertainments and , of course, military applications. ” As more I think about human society as more I like animals ” says my boyfriend, a theoretician who succeeded in academical career. Take care.

Leave a reply to Hedi Hegyi Cancel reply