Why ‘A’ Players Should Collude

They should have been paying me more.

That was my thinking, at least. I was building predictive models for PayPal to detect fraud, and my models were effective at saving the company money. From 2002 through 2004, that work was likely saving PayPal at least ten million dollars a year.

Knowing how much I was helping the company, I figured I’d have some pretty sweet leverage if I ever wanted to try to negotiate a better pay package. My work was easily quantified, and it was making a big difference.

If my models were capturing an extra ten million bucks, surely the company could reward me with half that, or at least a quarter of it, right? Especially since PayPal was being valued relative to earnings; at a multiplier of 30x earnings, those models had created an extra $300 million in shareholder value. Even a few percent of $300 million would make my day!

(At the time, I was not the type to “lean in” and negotiate harder, so that never came to fruition — but that’s a separate story.)

The Non-Zero Baseline

Sadly, my thinking was flawed. My assumption was that the baseline — the business equivalent to the “replacement” player in baseball was a business that was exactly break-even. I could add $10 million in profits, four others could do the same, and the company could effectively divvy up those $50 million in profits among the five of us.

The problem is that the baseline for PayPal — what would have happened with average players at every position — wasn’t zero. The baseline was a business that was losing hundreds of millions of dollars a year.

So I was saving the company $10 million per year, so were perhaps nineteen of my colleagues. My statistical models were ‘A’ work and a huge improvement over the baseline, but so were our fraud policies, our viral growth channels, our eBay integration, our legal maneuverings, and many other areas.

Without all of those accomplishments, PayPal would have gone on losing hundreds of millions of dollars until we went out of business. If the baseline of mediocrity was a business that lost $180 million per year, adding twenty strong people (or teams) whose “above replacement” value was each $10 million per year could collectively improve our bottom line by $200 million — but that would still only lead to a business that made $20 million in profits.

In that sort of business — which is roughly what the pre-acquisition PayPal of 2002 looked like — my original reasoning made no sense.

Collusion Is Good

Recently, I’ve been evaluating a new startup idea. It’s something that’s as ambitious as PayPal, but also just as fraught with challenges: there are a lot of ways that it could lose money. And that’s encouraged me to revisit some of my PayPal memories.

One of my biggest lessons is the importance of team quality for such a difficult and ambitious company. Looking back, it’s as if a bunch of smart and hard-working people colluded and decided to work together on a business that would have failed if we hadn’t all worked on it. Without that collusion, PayPal wouldn’t even have a Wikipedia page, let alone a huge business or a mafia.

2002 was the ideal time for that sort of collusion, because there weren’t a whole lot of options in Silicon Valley. No one was starting their own company, and the list of hot private companies I was aware of had exactly two entries: PayPal and Google.

To solve the toughest problems — and building a slightly more elegant social networks doesn’t qualify — one still needs that type of collusion from ‘A’ players. The good news for founders is that Silicon Valley is attracting more talent than it was in 2002; the bad news is that starting a company has become cool again (I’m guilty too!) and the hot company list has grown from two to dozens.

Collusion generally has a negative connotation, but in this context it can be a very good thing. If, rather than spread themselves among ten mediocre companies, ten all-stars can be like LeBron (and Dwyane Wade and Chris Bosh) and collude, they can see better results and solve bigger problems. And unlike LeBron, they don’t have to do it in zero-sum games.

Happy 10th Birthday, LinkedIn!

Ten years ago today, LinkedIn was born. It’s radically changed my life over the past decade.

I had no knowledge of LinkedIn’s existence on the day it launched in 2003. But a few days later, I got an invitation from my former PayPal colleague Keith Rabois to join this new site, LinkedIn. I signed up (as user number 1400 or so), saw a few familiar names on it, and was immediately intrigued.

In the months to come, I used LinkedIn a handful of times, connecting to colleagues and meeting a couple of entrepreneurs who reached out to me for advice on fraud prevention.

In late 2003 or early 2004, my former PayPal colleague Lee Hower reached out to me to see if I “knew anyone” who might be interested in working on data-type problems for this company LinkedIn. Lee and I had lunch, and I learned that LinkedIn had built its network out to be a couple of hundred thousand users. This was pretty cool, and something I could certainly see working on.

A week or so later, I was sitting in a Mountain View conference room with Reid Hoffman — with whom I’d spoken exactly once when he was a PayPal exec — and Jean-Luc Vaillant, brainstorming cool stuff we could do with data.

Not long after that, in February 2004, Jean-Luc set me up with a Mac laptop, and I started delving into the data. For the next few months, I was essentially moonlighting at LinkedIn, coming into the office one or two days a week, while mostly still at PayPal.

When I found that I was consistently more excited to get out of bed on the LinkedIn days than on the PayPal days, I decided I would (after biking around France for a month!) join LinkedIn full-time.

Thus began my LinkedIn employment odyssey. I worked full-time at LinkedIn from October 2004 until January 2007, leading a small analytics team with two awesome hires, Jonathan Goldman and Shirley Xu.

I learned a ton at LinkedIn, worked on some interesting and important products, and got to collaborate with lots of great people I now consider friends (and of course, LinkedIn connections).

After I left, my team became the large, influential and highly-regarded (kudos to Jonathan and DJ Patil) Data Science team.

Though I was no longer employed at LinkedIn from 2007 on, the company has continued to play a crucial role in my life. The first two engineers we hired at Circle of Moms, Brian Leung and Louise Magno, came in through the same LinkedIn job post in 2008.

Looking back at the Inmails I’ve sent, I see a number of people I reached out to try to hire and now know well; in many cases we didn’t wind up working together, but they became friends and valuable connections.

LinkedIn has prepared me for meetings with hundreds and hundreds of people. I wish I still had database access so I could run the query to figure out just how many.

When I first started poking around LinkedIn in 2003, I had a couple dozen connections. I looked at the profiles of people like Lee and Reid, seeing well over 100 connections. I figured I was simply not the kind of person who’d ever amass that number of professional contacts.

Today, I have over 900 connections on LinkedIn; the vast majority of those are people I’d feel comfortable reaching out to for an important professional purpose. Part of that increase is a reflection of my evolution, but a lot of it is thanks to LinkedIn.

To Reid, Jean-Luc, Lee, Allen, Chris, Sarah, Matt, and the millions of others who have helped to build LinkedIn, thanks and happy birthday!

Defending the Brash Arrogance of Silicon Valley

The best thing about being a statistician is that you get to play in everyone’s backyard.

I read this quote in a New York Times obituary in 2000, and it’s stuck with me ever since. As a data guy (I’m too much of a hack to be called a statistician), I love the idea of playing in lots of backyards.

If statisticians are playing in everyone’s backyard, the best Silicon Valley entrepreneurs are knocking down all of the houses and businesses in the neighborhood and putting in place something completely different.

Oh, and by the way, this is their first construction project, and it’s all going to be finished next month.

It’s pretty arrogant to attempt that sort of thing, isn’t it? It’s arrogant even if you’re not being forced to reinvent your neighborhood — or get the latest smartphone and move half of your communications to Facebook/Twitter/LinkedIn.

Early on at PayPal, the founders brashly spoke of reinventing the way people paid one another, even describing PayPal as a new world currency. Who were these founders who wanted to reinvent payments?

Peter Thiel was a thirty-something former lawyer and hedge fund manager who had little experience with either payments or tech companies. Max Levchin was a recent college grad with coding skills and no special knowledge of payments.

So did they hire other “experts” to do all of the detailed work? Not really. To solve the fraud problems that were draining the company, they hired people like me: a recent Stanford grad who’d never thought about, let alone worked on, understanding fraud patterns.

They did the same thing in other areas central to the company’s success. Peter brashly spoke of PayPal’s lack of “adult supervision”; clearly he reveled in being the cocky first-timer, destroying the experts at their own game.

Today, there are at least two Silicon Valleys. One is the Silicon Valley of yesteryear, building faster and smaller processors, bigger and clearer screens, and lighter and longer lasting batteries. The other is the Silicon Valley I got to know at PayPal. That Silicon Valley arrogantly tries to reinvent industries with a mix of deep technology, persuasive marketing, appealing products, and data-driven insights.

In the past few months, I’ve listened to entrepreneurs’ pitches for shaking up a vast array of industries, everything from diabetes care to home-buying, from photography to car insurance, from restaurant payments to government budgeting. Most of these entrepreneurs — like Elon Musk with SpaceX and Tesla, and Max and Peter fifteen years ago — have little or no experience in the industries they’re trying to upend.

In most of the world, people wouldn’t have the guts to do that. But Silicon Valley encourages a special type of arrogance, a type that claims a few smart “kids” can solve problems that have been vexing experts for generations.

Inevitably, that arrogance can be off-putting: is the slightly awkward 22-year-old computer science major who was spending half his free time at frat parties two months ago really the right person to reinvent health care?

Most likely, he isn’t the right person. But like science — a set of theories which are sometimes individually wrong but collectively get closer and closer to truth over time — Silicon Valley is a system.

By going after the big stuff — sometimes unjustifiably or arrogantly — the system collectively increases the probability of big breakthroughs. For that system to work well, you need a culture that encourages smart people who don’t know everything to brashly assert that they can do better.

I’ve seen that pattern again and again: like many in Silicon Valley, I’ve worked in a range of areas (payments, social networking, parenting) where most of my colleagues and I started with no expertise whatsoever. And the results — both for me and for many others — have been astounding, reinventing industry after industry.

Every now and then, someone like Chamath Palihapitiya bemoans the lack of big innovation in today’s startups.

I suspect that this claim is quantitatively wrong: though there are many frivolous me-too startups, there are probably more ambitious (arrogant) work-on-a-big-problem startups than ever.

Nevertheless, I completely agree with Chamath on where we need to go. Silicon Valley at its best is both arrogant and thoughtful: brashly trying to conquer problems others couldn’t solve, while thinking seriously about the societal ramifications.

Let’s go knock down some neighborhoods and build them up to be a whole lot better. Metaphorically… right?

In Search of a Happy Medium Between Academic Papers and Shlocky Business Insider Top 10 Lists

Here’s the state of the art in home page design for an academic website:

And here’s the state of the art in home page design for a rapidly growing media site:

Suffice it to say that I don’t want to emulate either the blandness of the first interface or the tabloid feel of the second one.

Now that Numerate Choir is a year old, I’ve been thinking about these extremes of online content, and trying to figure out where sites like Numerate Choir fit.

Virality

In the world of virality, headline usually matters more than content: the headline is what people click on, and it strongly influences what people share.

In the world of virality, short, simple, and universally entertaining is key. How something is presented matters more than what is presented. That top 10 list can be utter crap, thrown together in five minutes, but hey, look at the pretty slides! And my friend is #8 on the list!

The world of virality is superficial, but it has one very valuable characteristic. It understands what normal people value, and it concerns itself with what they will respond to in the real world. It doesn’t concern itself with “could” or “should”; the key term is “actually does”.

Art and Academia

In the world of art and academia, depth and truth should matter above all else. Headline is superficial: what matters is who reads something, not how many people read it.

In the world of art and academia, entertainment is looked down upon. It’s far better to be profound — and full of jargon — than amusing or captivating.

The world of art and academia is also superficial, but in a different way: superficiality is manifested as the desire to prove one’s sophistication. Those in the academic and artistic sphere generally don’t understand how to create content for the masses, but they are far more likely to discover things that matter.

Numerate Choir and a Little of Each

As I wrote about in my post on the state of journalism, there’s a growing dichotomy between 1) pseudo-news that can be both popular and profitable, and 2) deeper content that often fits better into a non-profit model.

I have a love/hate relationship with both of these extremes. And my work on Numerate Choir has oscillated between the two.

Like an academic, I write blog posts that I — and anyone with editing experience — knows are way too long even for a geeky Silicon Valley audience. And then I float to the other side: how can I craft a headline that will maximize sharing on Twitter? And why can’t I easily a/b test the stupid thing?

Like most academics, I do a poor job marketing and selling my work. I naively hit the publish button and hope for the best: beyond a quick post on the major social networks, I do nothing to publicize my writings. But then like the growth-loving impatient exec I criticize, I keep a close eye on Twitter to see if any of the cool kids have shared my new post.

Like an academic, I hope that what I write will help people understand an issue and maybe think a little more deeply. Like a growth hacker, I do care about how many people read them (even if I recognize that they aren’t for everyone).

It’s amazing yet discouraging that my most read post — by a factor of two — was a relatively lightweight rant on not needing a real-time dashboard. That post was written in response to a friend’s emailed question; I wrote and published the whole thing in under two hours.

(Credit where it’s due: that distribution was helped in large part by Andrew Chen cross-publishing it on his blog — thanks, Andrew.)

I often write with a lofty goal: capture some of the truth and quality of academia done right, and reach a larger audience. Sometimes that’s worked: The Visionary and the Pivoter and A Founder’s Constant State of Rejection (also on founderdating.com) were both read by lots of people. Two other posts I was especially proud of, Why Big Data Trumps Small Data and In a Data-Driven World, Honesty is the Fundamental Virtue, were far less successful metrics-wise.

My blog posts have generally followed one of two patterns:

  • I post something, then link to it on Facebook, Twitter, LinkedIn, etc. A few of my friends read it, and one or two of them might retweet the link. The post is seen by at most a couple thousand people.
  • I post something and link to it and a few people click through. That’s the “soft launch.” One or two of the clickers happens to be a friend who’s an order of magnitude better known than I am. When he tweets it out himself, we have a “real launch” (thanks Naval/Keith/Andrew/Eric/Jeremy/Dave). That tech celebrity share serves not just as a driver of traffic but as social proof to others: this is not a sleazy Business Insider post.

Hence Twitter is essentially an oligarchy: a handful of people have most of the power. While many could exercise it better, it winds up being an acceptable model to propagate this sort of semi-intellectual, semi-popular content.

Metrics for Success

What metrics define success for this blog?

The business model for most online media sites is pretty simple: visitors and page views translate directly into ad dollars. In most cases, the revenue per user is constant: it doesn’t matter if the viewer is President Obama or my one year old daughter banging on the keyboard.

For this blog, revenue is and will be zero, regardless of how many readers there are. I’m not sure what the most important metric is; I can think of at least five that matter to me.

Several times, I’ve written about topics I’ll almost surely never touch professionally. When I do this, it’s often with the hope that somehow it will reach someone far more influential than I, and affect a positive change on the world.

Other times, I’ll write to crystallize my own thoughts. If I get great feedback, that’s nice, but I’m writing more for myself.

Sometimes, my posts can be a good way to tell a bunch of my friends about my experiences; they then inspire deeper conversations about interesting topics.

A couple of times, I’ve been able to link people I know to a post I wrote explaining my thoughts on a particular issue. This means I don’t need to write it up again and again. As any engineer knows, re-usability is good.

Finally, I hope to share my personal learnings with others. I can think of only two places where a 35-year-old can be considered wise: a society where the life expectancy is 40, and Silicon Valley. I’ll drift off that sentiment if I can.

In a Data-Driven World, Honesty is the Fundamental Virtue

In times of war, there are no greater virtues than loyalty and bravery. A country with disloyal citizens is likely to lose every battle. Bravery is an essential trait to overcome the harshness of war.

Likewise, for most of human history, sexual purity was promoted as an essential virtue. Because contraceptives were not an option, a promiscuous society would be one with frequent unwanted pregnancies. As a result, many societies developed strong cultural norms to discourage physical relationships before marriage.

Today, the world is relatively free of wars and effective contraceptives are widely available, so these traits are valued less. Our culture constantly re-evaluates its norms.

Meanwhile, as more and more of our lives are recorded, the data we collect facilitate better decision-making. I’ve written about many aspects of that: data help journalists find better stories, help predict the future, and much more.

However, a data-driven society is only functional when people follow the right cultural norms. In a country at war, a culture of loyalty helps ensure that everyone is in line. In a capitalist society, a culture that discourages theft can allow small businesses to prosper without fear of losing property. In a data-driven society, we must stay intellectually honest.

Without intellectual honesty, the data are flawed and unreliable. Flawed data lead to poor decision-making; it’s usually better to use only your gut than to rely on a poorly formed data set. And unfortunately, many people are using data in intellectually dishonest ways.

Schools and Cheating

In a data-driven world, we must not cheat.

One of this year’s Goldsmith Award finalists is an astonishing data-driven series which uncovered high levels of school cheating. I’m proud that we honored that story, but I’ve been somewhat taken aback by some of the responses I’ve heard to it.

Many people I’ve spoken with, when told of this investigation, immediately blamed the reward structure. No Child Left Behind, they say, created an overly pressurized education system. The sort of large scale cheating exposed by the Atlanta Journal-Constitution was inevitable given the high stakes of the tests.

That is nonsense. One wouldn’t excuse a CEO stealing money from others because there was so much pressure on him to improve his company’s performance — even if the CEO thought the means of evaluating him were unfair. No Child Left Behind and other education policies aren’t perfect (my suggestions), but they’re a starting point.

To improve, we’ll need to refine our testing system and get better at measuring progress. We’ll also need a culture of integrity from teachers and administrators. Without that integrity, our system will consist of results we can’t trust — and a terrible example for students.

Guns and Intellectual Curiosity

In a data-driven world, we must approach major issues with an open-minded, intellectually curious approach.

Of course people use data dishonestly for political arguments. But it’s not just sleazy politicians: I see intelligent friends on both sides completely misrepresenting the data on gun violence in the US. Anti-gun advocates point out that the U.S. has more gun ownership and more gun deaths than other Western countries and jump to the “obvious” conclusion that more guns means more violence. Pro-gun advocates point out that the U.S. has far higher gun ownership rates than many (non-Western) countries that are much more violent than they U.S.; they jump to the also “obvious” conclusion that criminals will find a way to purchase guns regardless of gun policy, meaning decreases in gun ownership would have no impact on gun violence.

It’s likely that each side is at least partially correct. Millions more guns in Americans’ hands mean at least a few more deaths; many of the most violent criminals will find a way to kill regardless of gun laws.

Yet my friends who post these stats do so with a lack of intellectual curiosity. In most cases, they haven’t looked at the numbers with an open mind, and they don’t really understand how gun dynamics work. I’ve never seen, for instance, someone point out that gun ownership in states is highly correlated with suicide rates but minimally correlated with murder rates. That fact — which implies that less restrictive gun laws may lead to suicides but not terrible crimes — doesn’t fit neatly into anyone’s pro- or anti-gun view of the world.

I’m realistic: I don’t expect that everyone is going to gather data sets on gun violence on their own. However, because the data are out there, I ask smart people to raise their bar: if you haven’t looked at the data closely enough to have an informed, nuanced opinion, please keep quiet. Either do some real research, or don’t spread your uninformed perspective.

Pitching Investors and Misleading

In a data-driven world, we must not mislead or be misled.

Working with many early stage companies, I see a lot of investor pitches.

Startups have gotten better at crafting an appealing pitch, by throwing out numbers like these:

  • a) We’ve increased revenue by 25%, month over month
  • b) Our user base is growing: we had 500,000 users last July and now have 800,000 (often with an attached graph showing total users at the end of each month)
  • c) Our monthly retention rate is 75%

Most of the time, these stats aim to mislead.

(a) doesn’t have a baseline or a time horizon. It might mean that revenue was $12 last month and is $15 this month. Does that sound as impressive?

(b) looks at cumulative signups rather than monthly signups. Cumulative signups are always going up, and it takes a lot more effort for the viewer to see whether the second derivative (signups this month versus last month) was positive or negative. We did this in investor presentations for Circle of Moms, because we knew it would obfuscate some of our negative trends. I advocated that approach, and I don’t feel great about it.

(c) may be a useful stat, but it’s almost always calculated in an obfuscated, company-friendly way.

These kind of pitches are “just the way it is” — as is the case for misleading political data analysis. But in a data-driven world, we need to aim higher.

Entrepreneurs should assume their audience is intelligent and mature, cognizant that not all numbers go up.

And investors should understand what they’re looking at, and should call BS on entrepreneurs who surface their numbers like this.

Conclusion

Access to data is, on the whole, a very good thing. Deep, data-driven knowledge allows us to make better decisions and preserve resources. With the right data, we can better reward the best teachers, fund the top companies, and create better public policy.

However, for that to work, our society needs to create a stronger culture of honesty around data. We can’t cheat to get around failures. We must seek out all of the facts, and not promote only those that fit into a narrow ideology. And we must use data to inform rather than mislead. If we don’t do those things, we’ll make decisions that are driven by flawed data — lies — and many will suffer.

The good news is that we’re still early in an age of data-driven decision-making. Our collective culture has developed to better discourage practices like stealing and killing. In this wonderful age of data and better decision making, can we become more honest?

Journalism, Through the Eyes of a Data-Focused Entrepreneur

Fueled by wine and delicious food, the table was full of energy. The journalists and politicians at my table were eager to outdo one another. They exchanged candid personal stories about famous TV newsmen and potential presidential candidates. They recounted tales of the shocking political corruption they’d uncovered. They told their colleagues what had really gone on at that recent big event. Unable to compete with their stories, I nodded politely.

Somehow, I’d found myself at the upscale Rialto restaurant in Harvard Square, feeling like a fly who’d landed on the wrong wall. I was twenty-four, a quiet and unassuming fraud R&D scientist at a small Silicon Valley startup called PayPal. The eight or nine people at my table were quite unlike me: all big names from journalism and public policy, all far more extroverted than I, all at least twice my age.

It was January 2002, and I was sure that two worlds couldn’t be any more different. Here in the journalism sphere were politicians and journalists, extroverted and boisterous, who told great stories; back at PayPal, analytical, introverted nerds whose skills were mostly technical. At the dinner table in Cambridge were those who could talk to sources and get the scoop; back in Silicon Valley, engineers who automated processes and crunched data.

Within about a decade, all of that would completely flip. News organizations would embrace the data-heavy, analytical approach more common to tech companies. Many of 2013′s top stories would use data at a level that was unfathomable in 2002.

The Goldsmith Awards

My first journey to Cambridge was set in motion only a few days before that dinner, when I got an urgent call from my grandfather.

Can you be in Boston on Saturday to help select the winners of the Goldsmith Awards?

I had only a vague sense of what the Goldsmith Awards were; why exactly should I fly across the country on a few days’ notice?

The Goldsmith Awards, he explained to me, were something he’d worked with the Shorenstein Center at Harvard to set up, using assets from the estate of the late Berda Goldsmith (his legal client). The awards honored great journalism, but their true goal was to foster better public policy. He wanted to reward journalism that shines a light on government, highlighting bad regulations and bad policymakers for the benefit of ordinary citizens.

Being a lawyer who was both thoughtful and crafty, Bob put in place a contract that would maintain close ties between the foundation and the Shorenstein Center. A key clause in the contract stated that one of the award’s judges must be a foundation representative.

Bob had called me because he wanted me to take over as the foundation representative. He hoped that I could go to Cambridge on Friday to see how everything worked. Then the following year, I could represent the Greenfield Foundation’s on the Goldsmith selection panel.

Of course, I said. I’d long been interested in public policy — I minored in Political Science at Stanford — and this would be a real honor.

2002: Reporting

At the judging session — before the dinner at Rialto — I got my first jolt of culture shock. The Goldsmith panel of judges evaluated dozens of newspaper submissions, and their criteria were often a surprise. I’d long been a consumer of the news; that day I got to see a news professional’s perspective for the first time.

Some stories were very impressive on the surface: engaging, in-depth, surprising reports on a policy topic I knew nothing about. It turned out, however, that they closely resembled another story told earlier by someone else. The first story a journalist told about toxins in the local drinking water was probably very impressive; the twelfth such story — reported using the same template as the first one — doesn’t deserve an award.

More stories were dinged for reasons I wouldn’t have fathomed as a mere news consumer. Some were largely the product of a single leak: they came from an insider who wanted his story told, rather than from sleuthing by the reporter. Others were impressive investigative feats, but pointed to flaws in public policies which had virtually no chance of being changed.

The best pieces that year were original, impressive in the depth of their investigations, and had substantial impact on policy. The winner, about hospital care at the Hutch in Seattle, stood out for the sheer amount of manual work it required: the reporters had to wade through “100 interviews and 10,000 pages of documents” to tell their story. The story was amazing, and was notable to me for the set of skills used by the reporters: their methods were a far cry from the algorithm coding I was doing at PayPal.

2013: Reporting and Data

Serving on the Goldsmith panel soon became a tradition for me. Last week, for the twelfth time, I found my heavy winter jacket in the back of the closet (it’s useless in the Bay Area) and packed up for a January weekend in Boston. I’m now the veteran; this year I served with several people who had never judged a competition like the Goldsmith Awards. I still have yet to judge with a panelist younger than me, but I no longer elicit the “what’s that little kid doing at the table?” stares I saw a decade ago.

But that change is predictable: I knew I wouldn’t stay twenty-four forever.

The big surprise is that investigative journalism, so different from my PayPal day job in 2002, now feels like a natural project for a Silicon Valley startup data guy like me.

Journalism has changed a lot in the past decade. In 2002, almost all investigative stories were anecdotal. A story about ineffective education started and ended with interviews of teachers, parents and students. The investigation about medical treatment told stories of the travesties patients had endured, without using terms like “probability” or “percentage”, let alone “false positive”. Occasionally, a Goldsmith submission would talk about the painstaking work that reporters had done to piece together hundreds of paper records to assemble some basic statistics.

Today, by contrast, data analysis plays a huge role in many of the top stories. Of this year’s six finalists, at least three would have been unlikely or impossible ten years ago:

  • Cheating our Children, from the Atlanta Journal-Constitution, is a story about cheating by teachers and schools on standardized tests. The team looked at thousands of districts across the country for highly suspicious anomalies, like every student in a class (supposedly) erasing an incorrect answer to question #27 and then filling in the correct answer. They found several hundred patterns of student improvements that were most likely the result of fraud.
  • State Integrity Investigation, from the Center for Public Integrity, looks at the laws of each of the fifty states and grades each on their risks for corruption. To do this, a reporter in each state perused that state’s practices and regulations — a far more manual approach than Cheating our Children — and assembled a database of information about that state. The end result is both a great way of pressuring states (Utah, don’t you want to improve your D?) and an incredible Wikipedia-like online resource for others (especially journalists) interested in tackling related topics in the future.
  • The Shame of the Boy Scouts, from the LA Times, is the sad story of thousands of incidents of child sexual abuse records in the Boy Scouts. The Times pulled together thousands of newly released Boy Scout child records, using them to tell many unbelievable and sad stories about children who were molested. But the series complemented those stories with a feature that would have been unlikely a decade ago: they posted all of the documents online, for anyone to search and see.

These new data-centered stories are distinguished by three new attributes. The first relates to how a story was uncovered: many stories today are initially found not from a tip, but from a database search. In Cheating our Children, cheating was uncovered not because of a tip from a parent or a teacher, but because of a search for suspicious trends in the data. The steps to get the story were (at a high level) similar to what I was doing at PayPal in 2002: using algorithms to identify a handful of likely fraudsters.

The second data attribute is quantifiability. Historically, journalism has not been a quantitative field, relying instead on an anecdote or two, along with an assumption that “there are many others like them”. And while quantification would be silly for many stories — either Nixon’s people broke into Watergate or they didn’t — it’s an important part of any broader societal story. In 2011, the Goldsmith winner informed the public of local hospital practices that were quantifiably worse than others out there. This year, there were some great not-quite-finalist stories that found and measured the effect of cops speeding and explained just how harmful overly prevalent pain medications can be.

Finally, many of the top stories today are complemented by a structured, searchable database. Each of the three stories above features an interactive tool allowing anyone to find the information most useful to them. On ajc.com, I can look at my local school district for evidence of cheating; on publicintegrity.org I can see how my state fares with respect to corruption risk factors; on latimes.com I can see whether there were any reports of sexual abuse at a specific Boy Scout troop.

Though the world of journalism has its challenges, these are three great developments. They widen the range of stories journalists can tell, they raise the bar on their quality, and they make them individually relevant to the reader.

The Great Bifurcation

The landscape of news and other content is bifurcating, with increasing separation between work that aims to be high-traffic and work that aims to be high-impact. On one side is entertaining content, aimed at driving page views. That content may be news, opinion, or something else, but its goal is very simple: to be part of a traffic machine that underlies an ad-supported online operation.

Traffic machine content is most successful if it arouses curiosity (yes, I do want to check out the six ways that olive oil can help me lose weight!) and can be even more so if it’s also something the reader identifies with and wants to share (this is why people who voted for my political candidate are smart!). That high-traffic story, while cheap to produce, is usually not especially deep or insightful, and it may not even be true. Thus it has little or no positive impact on our institutions.

The other side of the coin, high-impact journalism, is a very different animal. It takes a lot of work and money to produce, but often doesn’t generate a lot of traffic. It may have a great impact on society, but it’s tough to justify for a business. And that’s why, increasingly, it’s the domain of non-profit entities and organizations that are only nominally for profit.

Journalism via a non-profit can be a good thing: those organizations — who today operate at both local and national scale — can focus on the highest impact work rather than try to mix unpopular high-impact stories with popular low-impact ones.

2023: Reporting, Data, and Software

In the new non-profit news organization, there is a simple question to ask: “how can we do work that will have the largest positive impact on public policy?” That is essentially the same question my grandfather asked when he set up the Goldsmith Awards over two decades ago.

To understand how that question will be answered a decade from now, one must first understand the roles played by three different people in today’s professional world:

  • The investigative reporter skillfully combs through documents and asks the right people the right questions to find information. He then turns that information into a compelling story for his audience.
  • The data scientist takes the data available to her and mines it to quantifiably understand a subject. With words, numbers, and data visualization, she shares — usually with less verbal skill than the journalist — that understanding with others.
  • The software developer takes a process that works manually, and figures out how to first generalize it and then automate it. For instance, if you have a meeting in your calendar with an accompanying address, how can software automatically send you directions at the appropriate time? A human can do it manually; the developer writes the software that will automate the process many times over.

A decade ago, the stories I read for the Goldsmith Awards were solely the work of reporters from the first group. They were executed by skilled journalists who knew how to comb through documents, convince insiders to give them secret information, and write stories elegantly.

Today, the data scientist is a key part of journalism: data skills are nearly as important for producing Goldsmith-caliber work as classic investigative skills. Data skills help both at the early phases of a story in finding anomalies worth writing about, and in moving beyond anecdotes to show that trends can be quantified. That anomaly-finding helps increase the range of stories that can be told; quantification makes the stories better.

Still, today’s journalism has a one-off quality that would frustrate a typical software developer. Sure, I can read a story about cheating in schools — or even look at how it affects my hometown — but will the story be automatically updated in three years so it’s still relevant?

In the next decade, it’s likely that we’ll see investigative reporting evolve and improve in several ways:

  • More and more journalism will be automated and updated regularly. District scores will be mined every week; state corruption will be automatically assessed monthly. In some cases, there will be written stories that complement the new data; in other cases the automated jobs will simply feed into an interactive database available to readers.
  • Investigative reporters will get better at soliciting information from their readers and viewers. It’s become a lot easier for readers to contact reporters with tips than it was a few decades ago, but there’s still a lot of room for improvement. Facebook, LinkedIn, Quora, and Twitter make it easier to find and contact the person likely to know a specific piece of information, but they’re not ideal. One could, for instance, imagine a world where citizens could record any suspicious or unacceptable government actions in a form that could be searched by reporters in the future; this would markedly improve many investigative stories.
  • The number of journalists with data skills is increasing rapidly, and that isn’t going to change any time soon: my Twitter feed is filled with data+government+journalism enthusiasts from many different backgrounds. They’re offering online courses, pushing for open data, and a lot more.
  • More and more data — particularly from governments — will come online. The picture today is awful: most government documents are still posted in unstructured form as PDFs and Word docs, making data analysis a lot tougher. That will change.

These changes will allow journalists to more quickly find important stories and tell them more accurately. At a time when some news organizations are slashing budgets and others are defining themselves, that’s important.

Merging Worlds

When I went to Harvard eleven years ago, I couldn’t help feeling like I didn’t quite belong. It was an honor to be part of the Goldsmith Awards, but I was there because I happened to be the grandson of the awards’ founder.

This year, I flew to Boston a day early and spent time with Alex Jones, the longtime Director of the Shorenstein Center. As always, I learned a few things. Alex told me about Journalist’s Resource, a great online tool which lets journalists freely access research on complex topics. He highlighted the increasing role of data in journalism and among many of the top Goldsmith Awards contenders.

While there, I also chatted with Nicco Mele and John Wihbey, both staff members at Shorenstein. Nicco lectures on technology and simultaneously runs a web consultancy; John is the developer behind Journalists’ Resource. Both were full of ideas on how data, journalism, and technology can come together to improve public policy, telling me about cool projects like Journalist’s Resource and Nearby FYI. It was inspiring, and in my conversations I saw a new take on my grandfather’s vision for journalistic impact.

That Saturday night, after a full day selecting the Goldsmith finalists, seven of us met for dinner at Rialto. I was still the introverted techie, and I still didn’t come armed with personal stories about Clintons or Bushes. But having just discussed such a strong set of data-heavy stories, I knew something was different. The landscape has shifted, and journalists have caught on to many of the skills my friends and I value in Silicon Valley. Once just a fly on the wall, the data geek is now an important part of the story.

Thanks to Ben Greenfield for his great feedback on this post.

Six Steps to Growth: What I Learned as 500 Startups’ Growth Hacker In Residence

I just finished up a six month stint as Growth Hacker In Residence at 500 Startups. I want to share a few of my learnings.

But before I do that, a couple of things to get out of the way.

First of all, 500 Startups is a strong operation and an amazing story. If you’d just arrived in Silicon Valley yesterday, you’d probably have no idea that it’s only a couple of years old. In public, Dave McClure is a little zany/off the wall/all over the place, but that’s far from the entire story: he understands the venture world, he has good ideas on how to shake things up, and he’s built up a good team that complements his skills.

500 Startups has two major unfair advantages: a huge network of talented mentors helping portfolio companies (with many others clamoring to mentor) and impressive ties abroad — incredibly valuable for deal flow, fundraising, and partnerships. 500 acts like a growing startup rather than a greedy venture firm: with a small fund, Dave opts to hire more employees who will help startups, rather than keep extra money for him and the other partners. Venture capital is going to get shaken up over the next decade or two, and 500 Startups’ strong network and ungreedy, open-minded approach will put them in good position to thrive.

Second, any thoughtful person who understands how to grow an Internet business cringes at least a little bit at the term “growth hacker.” The hype-to-substance ratio has become high: I suspect that a high percentage of the 338 people who show up in a LinkedIn search for “growth hacker” don’t have any idea what they’re doing.

So it was with a tiny bit of reluctance that I accepted the “Growth Hacker In Residence” title. For better or worse, 500 is the ultimate anti-corporate firm, so I don’t have any business cards to implicate me. But it all worked out well.

Why Growth is Important

I didn’t push back on the title because there is in fact a lot of substance behind the term growth hacking, and hype isn’t always a bad thing. The underlying skill it describes — increasing a product’s usage and distribution — is incredibly valuable in consumer technology companies. And contrary to the complaints of some, growth is not just product marketing by another name: skills in product design, engineering, and data analysis are just as relevant.

To see why growth is so important, one need only look at the big successes of my former colleagues at PayPal (aka the “PayPal Mafia”): YouTube, Yelp, Yammer, and LinkedIn. All succeeded in large part because they were designed for growth. The ability to embed videos on MySpace separated YouTube from the pack; Yelp mastered SEO for traffic and retained a small core of “elite” reviewers to build content; Yammer made it exceptionally easy to invite colleagues; LinkedIn made connecting with professional contacts seem like an obvious and important need.

I don’t think this is a coincidence. PayPal had to scrap and claw to grow and survive; its alumni figured out how to grow their next companies the same way. By contrast, early Google was a company with more impressive technology that didn’t have to scramble to grow. Its employees likely learned less about growth while at Google, and those alumni have since had far less success with distribution and startups in general.

In that respect, most entrepreneurs are more like the Google non-mafia than the PayPal Mafia: they don’t bring to their startup a strong understanding of how to grow and maintain a large base of users. That lack of understanding is true of most 500 Startups companies — as it is for the companies of every investor. Fortunately, that skill is — to an extent, at least — teachable.

Helping Companies

So how could I help the teams of 500 Startups?

For one, I could help by setting the right context: growth is at least as much discipline and execution as creativity.

It turns out that there aren’t usually easy solutions to growth challenges. This is where the hype around the term “growth hacker” can be dangerous. Hacking implies something weird and unpredictable. It evokes a sense that the perpetrators are more crazy artists than precise scientists, that their methods are unusual and tough to replicate. And it implies that there’s some sort of obscure code that can be cracked to yield the magic growth solution.

If that were true, there might be a simple but hard-to-find switch to flip. This is almost never true. “Change the button color to green, and you’ll immediately go from 100 users to a million!” is probably not advice to bet your business on.

The Steps

A consumer startup with good product can become proficient at growth via a fairly straightforward series of steps. As a founder, I informally understood that series; working with the companies of 500 Startups allowed me to formalize it as a six step process:

1. Track: Figure out what needs to be tracked. Track it.

2. Understand: Delve into the data to understand how people are using the product.

3. Prioritize: Evaluate and prioritize the areas most likely to yield growth. Sometimes they’ll be tweaks, sometimes they’ll be re-architected features, sometimes they’ll be completely new features.

4. Design/Write: In the top area or two, design a few features that are likely to yield growth. I emphasize writing because the words describing a product often matter at least as much as any other characteristics.

5. Build: Code it up, push it out.

6. Measure: Gauge success of new features. GOTO 1, 2, or 3, adjusting strategy based on the results.

The Process

Much like a signup flow that sees significant dropoff upon asking for the first piece of information, companies often get sidetracked before finishing the first step. Tracking and storing data — so you can actually go back later and see what went on — requires discipline and is low on glamour and creativity. You know that tracking won’t help you tomorrow: isn’t it more important to build out that cool feature that might actually help right away, or fix a bug that a user is complaining about?

Usually, no. Those who try to grow by relying only on designing and building often wind up spinning their wheels: they work on the wrong area of their product, again and again; they never completely understand what worked and what didn’t.

In my stint at 500, there were a few times when a company I worked with didn’t get past step one. Needless to say, that was frustrating for me, and it forced me to come up with a rule: if you want me to spend my time helping you grow, you need to set up the basics of tracking before we start working together. That rule worked well.

The technical details of what needs to be tracked vary; a more mature company will usually need to store most user actions in their own database. Pretty much anything the company cares about improving — the number of users coming in via certain channels, views of certain pages, clicks via email, invitation responses — needs to be stored.

Once the data are in place, understanding the big picture comes to the fore. How are people finding out about the product? How and how frequently are old users coming back in to the product? What, if any, are the social dynamics: are people coming back because someone commented on something or because someone followed them?

When that’s in place, it becomes relatively easy to come up with some estimates of the metrics impact of potential product improvements. To be sure, those estimates are more likely to be accurate when done by someone who’s done them before, but anyone can and should make them.

Those estimates facilitate an ordered list of possible product changes. These changes might be minor — changing “Next” to “Continue” or “Submit” — or major, like building out a completely new signup process. And again, designing or writing by someone who’s skilled and knows what’s likely to be effective, is valuable.

Building out those designs is the obvious next step; I generally didn’t get directly involved with that at 500.

Measuring the effects of those changes — for companies with data scale, via A/B testing — is the vital last step. It’s amazing to me how many companies will spend weeks working on a new feature or design, and then never really know if it was effective. If you’re tiny and just trying to get some kind of win, that’s okay. If you have an audience of hundreds of thousands of people or more, it’s negligent.

Succeeding As an Adviser

I effectively served as an adviser to the 500 Startups companies I worked with (though not financially), and I’ve come to realize that it’s not easy to add value as an adviser. Many advisers get a bunch of equity in a company, but don’t add a lot of value. If I agree to advise a company, I know there’s some chance that my advice won’t end up being relevant, but I want to maximize the odds that the time I spend with founders and employees has a measurable impact.

As a founder, I intuited those six steps, and then spent hundreds or thousands of hours building everything out with a team of people I saw every day. As an adviser, I’m spending hours or tens of hours, so my thought process has had to be tighter and more formal.

The six steps outlined above can work pretty well regardless of the scale of involvement. There are a few companies with whom I spent a few hours a week over several months. With those companies, we spent a bunch of time upfront to make sure everything was being tracked. Then every week or two, we’d run through steps 2-6: finding a problem area, brainstorming and designing possible improvements, building, and measuring the results.

With most 500 companies, my interaction was lighter: a 30-minute conversation every month or two. That may seem like too little time to go through six steps, but an abbreviated version can still work well. The conversation would usually look like this:

  • 1 and 2. Tell me about your business. How do people find out about your product, how do they get back in to the product, etc. This works better if the founders get answers in advance.
  • 3. Let’s figure out (in part by looking at the existing product) which areas to prioritize.
  • 4. Let’s look at the product and try to narrow down (almost) exactly how the changes will look.

Then the startup does the details of step 4, and all of step 5 on their own — actually building it out — and does the measurement step (6) with a little bit of guidance.

That process isn’t always going to be effective, in large part because most product changes don’t work as hoped. But by understanding the problem and betting on the changes with the highest expected returns, I’ve seen many wins. Here are a few:

  • One company I worked with, TradeBriefs, gets most of their return traffic from email newsletters. That became obvious when we walked through the basic metrics of the business to understand how everything fits together. The content of those newsletters had always been hand-curated, which means it consistently looked professional but wasn’t optimized.

    In previous lives, I’d seen how similar types of email content can have vastly different clickthrough rates. To better discover which content will yield maximum click rates, the team created a system that allows them to test different versions of similar emails each day. Their volume was large enough to allow them to randomize content for a small batch of users early in the day, wait, pick a winner, then send the winning (best) content to the majority of their users later in the day. They implemented that and saw a 30% improvement in their email click rates.

  • Another company I worked with is a community where users are creating all of the content and interacting with one another. They were seeing lower usage among people who had been on the site for more than a few months, and they wanted to address that.

    We spent a lot of time upfront on metrics, tracking activity levels and looking at how users transition across differing levels of activity. That digging led us to realize that people who became moderately active by day five were far more likely to stay active than those who had not. So we focused our efforts there, testing lots of little tweaks in the new user experience to get people to activate in their first few days.
    Cumulatively, those tests led to a 30% increase in initial activation; we expect that to pay similar dividends long-term.

  • A third company is largely a commerce company that makes money when people buy physical goods. Most of their traffic — and purchases — were coming from advertising on Facebook and Google. The founder discovered that she could get more traffic from those same channels much more cheaply if she sent users to a more fun — but less commerce-focused — area of the site. However, only 4% of that cheaper traffic converted into leads likely to monetize, making it not worthwhile.

    Digging into the flow, we found a number of places to optimize the user interface — largely by making it intuitive to click through to the next step — and increased that 4% conversion rate to 16%. That difference was a huge one, immediately making the new stream of traffic immensely profitable.

Those three examples are in the minority: for most of the companies I spent time with, the immediate value I created for them was little or nil.

Yet I come away feeling successful. In six months, I’ve been part of a few real wins and found a template for how I — and hopefully others at 500 Startups going forward — can help startups.

Thanks to Dave, Christine, the staff at 500, and the many, many founders and employees I’ve worked with over the past six months. I look forward to continuing to work with you all as a 500 Startups mentor and friend.

And now it’s time for me to go and build something…

Goodbye to a Patriarch

This morning, my grandfather passed away.

Robert K. Greenfield was ninety-seven. He lived a long and interesting life, and he had a real impact on the world in which he lived.

When I was in sixth grade, I was given an assignment to write about a modern day Renaissance Man. I wrote about Bob and his knowledge and interest in countless areas — law, tennis, wines, public policy. At the time, he was seventy-four and seemingly winding down his life. Little did I realize that he still had more to accomplish than most of us do in our lifetimes.

In 2008, Bob had a health scare, and it looked like he might not have much time left. I wrote him the letter below, to let him know what he’d meant to me and my family:

I’d like to share a few thoughts with you, and hopefully give a little something back. You have played a considerable role in shaping who I am, and you will continue to do so, no matter what happens with your current condition.

Most people give up at some point or another. They try unsuccessfully to change things, and become cynical. They get worn down by the grind of life around them, and become indifferent. They succumb to greed, and become corrupted. They lose the discipline to take care of their bodies, and become slothful. They lack the intellectual rigor to seek truth, and become mentally lazy.

You aren’t like most people.

I am in an industry and a world that’s very different from the one you inhabited when you were my age. I sit in front of a computer all day; I solve algorithmic and mathematical problems, trying to construct a product that helps to bring people together; I try to stay as far from the legal world as possible! I am very much a product of my generation and my background: an Internet geek who lives in San Francisco, has traveled around the world, has no kids at age 30, and spends at least ten times as much money on food as on gasoline.

And yet, I cannot help but look to your example as a guiding force in my own life.

Elaine and I see your 71 years of loving marriage with Mommom as amazing, and something we aspire to (if we’re lucky enough to live that long!). We were so glad you made it to our wedding, and honored by your kind words there.

(An aside: a Chinese colleague of mine mentioned that she and her husband were very impressed by what you said; she said that your brief speech made it clear that you were the patriarch and leader of the family. Apparently these things cross cultural lines.)

I am continually impressed by the strong foundation you’ve built with the Goldsmith Prize. The more I learn, the more I realize how difficult it is to set up something like that, and the Goldsmith Awards have been an unbelievable success. Your combination of creativity, foresight, diligence, and personal relationships have uniquely crafted something special.

I will work hard to build on the Goldsmith successes, but it goes beyond that. I also use the Goldsmith model every day in my life: today I try to create a project of similar magnitude in the Internet world; tomorrow I may look for the next big thing in the world of philanthropy. Thanks to you, I’ve gained great insight into the process of creating something substantial, and a big boost in the inspiration needed to actually do it.

There aren’t a whole lot of people who can make me feel guilty for not working hard enough, but my 93-year-old grandfather is one of them. Your discipline, hard work, professionalism, attention to detail — I could go on and on — are astounding. I didn’t realize that when I was a ten year-old kid, but at a certain point I realized that your successes — academic, professional, family, athletic — were not the result of happenstance.

Many successful people get ahead by cutting corners, by sacrificing their principles or the common good. But that’s also not you: your honesty, your sense of fairness, your (stubborn!) desire to stick to the rules keep you grounded, and are yet another reason you serve as an inspiration to me and the rest of our large family. The ongoing trend in your life has been to look beyond what’s easy, and instead do what’s right.

My words are a mediocre expression of my sentiments and appreciation for all you have done for me and my family, but I hope they give some small sense of the effect you’ve had on my life. That effect will last as long as I will — thank you.

Fortunately for all of us, he lived another four and a half years. This past June, with dozens of family members, we celebrated 75 years of marriage between him and my grandmother. He got to meet my daughter a few months after she was born last fall, and then again before he died this month.

Amazingly, he was an avid reader of this blog. In his typical self-deprecating fashion, he explained how difficult it was for a “twentieth century mind” like his to understand the new things I was writing about. I know that wasn’t true.

The last time I saw him, less than two weeks ago, he was physically tired but mentally sharp. He was trying to figure out how policies around the fiscal cliff might affect his family, he was doing what he could to help with intra-family disagreements, and he recounted countless stories from nearly a century on earth.

Aside from the combination of physical frailty with mental sharpness, two memories from my recent trip stand out.

One memory is my grandfather’s awe at the possibilities that exist in the modern world. I shared some of my own startup experiences, and he explained how he wished he’d been able to do something similar. Bob certainly had an entrepreneurial mindset — after he retired he set up the Goldsmith Prize — but his day job as a lawyer was much more cut and dry. I’m incredibly fortunate: I live in a time and place where I can dream big dreams and run with them. Every day, I try to bring to my projects some of my grandfather’s intelligence and tenacity.

The second memory is a more personal one: my grandparents spent much of their time on the couch next to one another, savoring their final days together. My grandmother asked my grandfather if he still remembered their first kiss; he said he did, but amazingly their stories diverged. I feel sad for my grandmother: she’s only 95, with years of sneaky tennis drop shots ahead of her. But it was amazing to witness that kind of love after more than 75 years, and I’m glad that their four kids, fourteen grandchildren, and seventeen great grandchildren saw it and benefited from it too.

Thank you, Bob.

After the Thrill Is Gone

A growing startup can be bliss.

When Circle of Friends was growing rapidly, I’d wake up suddenly at three AM, my heart jumping with a mix of excitement and nervousness. Because our technology was brittle, I’d walk out to the kitchen and look at my laptop to make sure the site hadn’t crashed. And I’d notice that, amazingly, our traffic for the period between midnight and three AM was our highest ever.

Our highest ever? Our highest ever!?!? How could I go back to sleep now? It was three AM and my day had started: I boiled some water to make a pot of tea, and I got to work.

That was the startup honeymoon phase, and it was exhilarating.

Moving

Circle of Friends (which became Circle of Moms) had its first office in our little house in downtown Palo Alto. We’d bought the house in 2006, knowing it needed work. Elaine, being an architect, saw this as a positive: it was a house we could truly make our own.

After a few months where I was both living in the Palo Alto house (with Elaine) and working in its kitchen (with Ephraim during the day — but alone at 3 AM), the center of my world moved to San Francisco. Elaine and I decided we’d rent out our house and find an apartment in the city, and Ephraim and I would get an office in SoMa.

That basic arrangement — living near Dolores Park in SF, working South of Market) — was my life for nearly four years.

When, in early 2011, we found out Elaine was pregnant, renovations on our house in Palo Alto were nearly finished. At that point, we hadn’t decided what would happen after the house was ready to live in: we were open to selling it, renting it, or living in it ourselves.

The fact that we’d soon have a little daughter made living in it — and a move to Palo Alto — more appealing. Our own new place, with a nice room for baby, lots of natural light, a beautiful kitchen (nice work, Elaine!), and an edible garden seemed like a great option.

But I also knew myself and my work patterns. It was hard to imagine having a baby, a startup, and an hour-plus commute. Dealing with two of those three would be challenging; dealing with all of them would just be stupid.

The more I thought about it, the more I realized what a move to Palo Alto would mean: the beginning of the end of my time at Circle of Moms.

Elaine and I soon decided we’d move back to the house when our daughter was born, and that meant my time at Circle of Moms would be limited. Certainly not “I’m giving two weeks notice” limited, but also not another five years.

I told Ephraim of my intentions, assuring him that my goal was to get Circle of Moms to a place where we’d both feel comfortable telling the team that my role would change. We agreed that it would not yet make sense to share those plans with the rest of the Circle of Moms team.

That marked the beginning of a very different phase from the one where I’d wake up at three AM and excitedly sit down to work.

Departures

On a Monday morning in July 2011, Elaine was now six months pregnant, and we were at the doctor’s office. She had gone to the bathroom, and my phone buzzed with an IM from a colleague.

hey mike – will you be in the office today?

“Yes”, I replied, “am at a doctors appt now, will be in around 10″. A moment later, I got my colleague’s response:

oh okay – can i grab you for a chat sometime before 11?

Before 11? Crap. She’s going to tell me she’s leaving the company.

I was right, it turned out. And though my co-founder and I did what we could that week to try to convince her to stick around, we knew our odds were slim. We were ultimately unsuccessful, and she left Circle of Moms a few weeks later.

The Responsible Founder

At the time, there was a part of me that wanted to check out and move to the next thing, but as a responsible founder, that wasn’t a viable option.

In each of my three jobs before becoming a founder, I’d decided at a certain point that I’d had enough. Within weeks of coming to that conclusion, I’d told my boss I was leaving; by a few weeks after that, I’d formally moved on.

On that July Monday in 2011, I’d just heard my employee’s plans to leave and join another company, and I knew she’d soon be out the door. Meanwhile, my writing was on the wall and I was ready for something new — but acting on it now was simply not an option. I had to pull myself together, convince the team that everything would be fine, and get back to work.

Fortunately, the process went about as smoothly as it could. The team took things well, another employee immediately stepped into the shoes of the woman who departed, and we continued to make progress as a company.

But that summer was still a challenge for me personally. Weekends, which Elaine and I spent in Palo Alto, were awesome. We were in our beautiful new home, getting everything ready for ourselves and for the baby, cooking in our spacious kitchen, and enjoying the warm sun. During the week, we’d stay in cold, foggy San Francisco, and slowly pack up everything from our small dark kitchen.

Several other people left the company over the next few months. Our traffic was growing and we were executing well on the product, but we certainly didn’t have rocketship growth and our talented employees had plenty of other options. Again, I was tempted to follow my employees and also move on to the next thing; again it was not an option.

With all of that going on, I’d become more and more open to being acquired. That was a big change from a year or two prior, when the acquisition offers that came our way held little appeal for me: earlier, I would have been disappointed if another entity had ultimate control over the product and the company. But now, I’d been through the startup building process, realized I was unlikely to be the visionary of this business, and was open to a “strategic partnership.”

I spoke with another founder who had recently been through the acquisition of a similar-sized company. At a certain point, he told me, he’d realized that his company wasn’t likely to be a huge success, and that an acquisition was the way to go. He spent an entire year getting the company to that stage, in the process building relationships with his eventual acquirer, and said it was easily the toughest year of his life.

Leaving

Our experience was much the same: it was just under a year from the time I knew I was ready to move on until February 2012, when Sugar acquired Circle of Moms. It wasn’t the most fun year, but it was a productive one in many ways.

And I was fortunate. With a combination of luck and skill, we’d already built a real business, a strong team, a solid product. We wound up making money for our investors — even if it wasn’t a billion dollar sale — but the process still took us a year. Many founders spend a year or more on the less fun parts of running a company, without seeing a positive exit.

Christine Tsai, with whom I now work at 500 Startups, wrote a great essay comparing parenthood to building a startup. I agree with everything she says: in both domains, you work hard, it’s hard to figure out who to listen to, and it’s terrible… and fantastic.

There’s one other parallel I’ll add. As both a founder and a parent, you’re responsible, and you can’t just pick up and leave when you feel like it.

A mother’s work is never done. A founder’s work most likely will eventually be done, but it certainly won’t be today at 5 PM, and he can’t know in advance exactly when it will be.

And that means a founder can’t just give his two week notice and disappear. Even after the thrill is gone.

Why You Should Play Games In Inefficient Markets

Team Rankings, a business I started in 2000, takes a quantitative approach to understanding and predicting sports events. Over the years, we’ve built products targeted at a number of different groups within the world of sports enthusiasts.

Two products we’ve built — one to help people beat the Las Vegas line, another to help them win their NCAA Tournament office pools — are both data-driven tools to predict outcomes of sporting events. However, they reflect two very different sides of the world of quantitative decision-making.

The Skilled Opponent

When I was working at PayPal in the early 2000s, I was spending most of time building predictive models to detect fraud. At home one night, I decided to see if I could apply a similar methodology to sporting events to use on Team Rankings. My system incorporated a large number of inputs — how each team had been playing, what their respective strengths were, how far each had travelled — and used them to find a blackbox (i.e., really complicated) predictive model to assess the likelihood of a bunch of events related to a game. Among these: how likely is each team to win, how likely is each team to cover the point spread, and what is the expected final score.

It turns out that this approach works well enough to consistently beat Las Vegas. For the current models — which use essentially the same methodology — here are the results for college basketball since 2008-2009:

For these two bet types — against the spread and over/under — Team Rankings’ picks have been correct 5214 times and incorrect 4452 times. The odds of this level of success by a dart-throwing monkey or TV commentator — who would expect to be correct 50% of the time — is less than one in a billion.

However, while 50-50 constitutes “break-even” from a statistical perspective, the break-even point for a Las Vegas gambler is higher. The house takes an extra cut: in most cases, a gambler bets $110 on a game to win $100. That means that you need to be correct at least 52.4% of the time just to break even.

Team Rankings’ results beat that threshold too: 5214 wins and 4452 losses constitutes a 54% winning percentage. And that means that someone who gambles strictly based on Team Rankings picks will make a profit over the long term. Bet $100 on each game, and you’ll make an average of $7,000 over the course of a season; bet $1000 per game, and your expected return will be $70,000.

That’s a terrific return, and I’m proud of it.

But still, it begs a question. Team Rankings has a great team (I’m a tiny part), some cool technology, and lots of data, and the best we can do is 54%? That’s barely better than a coin flip! It means an expected return of about 3% on each bet — a good investment, but hardly earth-shattering and a far cry you get from the “100% GUARANTEED PICKS!!!” you see on sleazy gambling picks sites.

[Disclosures: Team Rankings' picks are for entertainment purposes only. Of course. And while I'm capable of thinking like a scientist (constantly skeptical), I'm also capable of thinking like a writer (cherry picking to prove my point). These are among the better-performing of our models, but overall our results are likewise very strong and profitable.

At some point, I'll take Anthony from Kaggle up on his offer to run a contest to predict games, so people smarter than I can have a crack at it. If we did that, we might improve the number to 55-56%.]

The Unskilled Opponent

Some of Team Rankings’ recent product innovations tell a very different story.

For many years, we’ve provided odds and analyses related to the NCAA Tournament. Our products have included tools to match up two teams, along with probabilities for each team to make each round of the tournament. So, for instance, you can see the odds of Gonzaga making the Sweet 16, the Crazy Eight, the Final Four, etc.

A few years ago, I realized that those features could help people win their NCAA Tournament pools, but only if they used our numbers in the correct context. Say, for instance, that Kansas is the team most likely to win the tournament, with 20% odds, but Kentucky is just behind at 19%. Kansas would be a better pick to win than Kentucky, right?

Not necessarily. Let’s say Kansas is a really popular pick, but Kentucky is not. 50% of people are picking Kansas to win and only 10% are picking Kentucky. If you picked Kansas, you’d have a slightly better chance of being right on the winner, but because you’d be competing head-on with many more people, you’d almost certainly have a smaller chance of winning your pool. In entrepreneur-speak, picking Kansas would be like trying to build a mobile photo sharing app in 2011, a group buying site in 2010, or a Facebook game in 2009: good business or not, you’re setting yourself up for a lot of competition.

So, with some help from Brad, we built a tool that allowed us to answer the question of what picks would maximize someone’s odds of winning their pool. To do that, we combined teams’ win probabilities (which many others do) with data on how many people picked each team (which no one else does). We simulated millions of tournaments, randomizing both the outcome of the games and the picks made by people in your pool.

The end results are pretty astounding. Our top brackets — based on a moderately conservative set of assumptions — had an expected return on investment (ROI) of around 800%. This bracket, for instance, would have had about a 0.9% chance of winning a 1000-person pool, nine times higher than the average participant. By picking as our champion Ohio State — a very strong team not given enough credit by the public — our odds would be much better than if we picked Kentucky, the best team but also one strongly favored by the general public.

Our 10-person bracket looked quite different: with a smaller number of people to beat, our simulations indicated it was a stronger strategy to pick mostly favorites, with Kentucky as the eventual champion. This bracket had a less ridiculous but still quite impressive 186% expected ROI.

Kentucky wound up winning the national championship. Most of our users in small pools did quite well and won their pool; our users in large pools did not. We won’t be successful every year, but over time our results have been very strong. And this approach works: I feel comfortable with the assertion that the average yearly return for our strategy will be at least 200%.

Comparing Markets

How do we synthesize all of this and bring it back to the non-sports world?

A bet on a game at a Las Vegas sportsbook and an entry in your colleague’s NCAA tourney pool both constitute a wager on sports. However, the underlying market dynamics could not be much more different.

Las Vegas is more or less efficient: if lots of people bet on one side of a game, they’ll update the odds. In contrast, your friends and colleagues in the NCAA pool are likely making impulsive decisions that are economically irrational.

Hence there’s one world (Vegas) where 3% returns are celebrated as amazing wizardry, and another (friends’ pools) where you can have expected returns of 900% without anyone really paying attention.

If you’re looking for a financial return, though, there’s a catch. There’s only one NCAA Tournament per year, so your opportunity to make money is limited. In theory, you could enter lots of pools with distinct but complementary undervalued picks, perhaps giving yourself a 50% chance of winning with only 15-20% of the pot. But you’d be putting your marbles in one basket.

By contrast, each year there are thousands of regular games on which you can bet against Las Vegas. Adding those together can yield a solid expected ROI over the course of a year. Quantitative hedge funds generally take this against-Vegas approach: they find short-term inefficiencies, and bet on them again and again.

Though difficult, Team Rankings and hedge funds show that betting in Vegas-style almost efficient markets can be extremely profitable. From the actor’s perspective, it’s a bunch of bets in established markets with positive ROI. Yet from the world’s perspective, it’s a bunch of minor market efficiency improvements, but a world that hasn’t really improved in any meaningful way. In other words, something that’s more Wall Street than Silicon Valley.

By contrast, the pool of your buddies — while itself not a world-saving problem — represents a far larger and more profound inefficiency. The large-scale decision-making of Las Vegas and Wall Street is close to being economically efficient, but one-off decision making by individuals and businesses is not. Most choices — companies deciding whom to hire or where to put resources, government choices on how to run cities and schools, individual choices on where to invest, and which teams to pick for your NCAA pool — are made haphazardly and could be improved a lot.

It’s tougher to build a business to address these massive inefficiencies: to build something large, you need to find important, quantifiable decisions that have associated data and haven’t already been examined properly. That type of problem is more interesting, has more upside (+800% vs. +3%), and can be much more impactful than its Vegas-style alternative. And it’s why, without hesitation, I choose Silicon Valley over Wall Street.