Archive for the ‘Assessing Performance’ Category

Time for a Gold Standard of Use

Monday, February 27th, 2012

Ask most people what “the gold standard” is and they’re likely to tell you it is a metaphor for “the best.” Indeed, when a New York Times opinion piece recently took the Obama administration to task for a certain environmental policy and the author called it “the bronze standard,” it was clear it was no compliment.

In the social sector, we talk about the gold standard as a specific kind of evaluation that has been designed to determine the impact of a program or set of activities. A gold standard impact evaluation uses randomization to determine cause and effect. It is based on the science of clinical trials and it is used to attribute change to a particular intervention. For instance, rather than just observing that students improved their reading skills, an impact evaluation allows you to know what caused that improvement: if the improvement was the result of a teacher, a new curriculum or perhaps a longer school day—or whether the change would have occurred anyway because of exogenous factors.

These evaluations are enormously important because they can provide evidence about what works to improve people’s lives; and, at least as important, what does not work. Many foundations are funding impact evaluations these days to inform national and international policy, funding decisions, and more effective practice. Seems like a sound approach, doesn’t it?

Unfortunately, it does not always work the way we plan.

Influencing policy?

First, relatively few policymakers rely on high-quality randomized studies to shape their perspectives and approaches. I recently spoke to a senior executive at a preeminent research organization dedicated to generating knowledge, largely through randomized studies, and communicating the findings to influence social policy. He explained that while evidence has been gaining ground in national policy debates, there are essentially four factors at play that inform and shape national policy: 1) ideology; 2) politics; 3) evidence; and 4) political horse trading. Evidence is only one of four factors — and it ranks third in the sequence.

When studies validate ideology and political orientations on both sides of the aisle, as they did on welfare reform – where the results highlighted the importance of work as well as the need for extra supplements to low wage work – policymakers used the evidence. But when it doesn’t validate ideology and reinforce strongly held beliefs, the evidence is often ignored, even when well communicated and well substantiated. Nationally, this is improving with new Office of Management and Budget standards that promote programs with rigorous studies of effectiveness, but still we see policymakers choosing the evidence that suits their beliefs rather than the other way around.

Influencing practice?

It can be even harder to make the evidence central to decision making in practice. A seasoned evaluator who has worked with many foundations and their grantees recently complained to me that even in fields where there is strong evidence of what the best practice is, only about ten percent of nonprofit practitioners follow those practices. Perhaps the proportion is higher than this evaluator estimates, but it is surely true that many nonprofits don’t operate in ways that are consistent with what the evidence shows. Why would this be? Why not use evidence to shape practice?

There are many possible explanations, but the two I think are most important are:

  1. A misalignment or a lack of incentives. It may be expensive to revise practice to align with the evidence – including investing in developing skills, hiring new personnel, or even changing the basic activities an organization performs. There are often no explicit incentives or funding to make the necessary adjustments.
  2. Not knowing the evidence. Many – especially small – nonprofits (and their funders) do not stay current with the literature in their fields and, as a result, may not be aware of the latest studies showing what works and what does not. There has not been an easy way to stay current on the literature and reading through reams of academic papers is not at the top of most practitioners’ or funders’ lists of priorites.

A further complication concerns an evaluator’s own belief system, which influences the interpretation of results for even highly rigorous experiments. When I asked an esteemed university-based evaluator who studies social service programs, “How often are your evaluation findings used?,” he said most of the time they were used for “tweaking.” In most of his impact evaluation studies, he further explained, there is “no effect,” meaning there is no evidence that the service being provided is helping people in any significant way. He exercises great care in the way he interprets and communicates such findings so that the evaluation results do not unduly harm what he views as essential social service programs for poor people. The primary value in this calculation is that any program serving the needs of poor people is better than no program, even if the program itself is not proven effective.

Influencing philanthropy?

In philanthropy, the evaluation picture is a bit mixed.

On the one hand, a 2011 Patrizi Associates study found that in recent years, foundations have placed less emphasis on evaluation – spending less money and reportedly having diminished influence on the foundation’s decision making. On the other hand, a recent CEP study found that 90 percent of foundation CEOs report that their foundations conduct formal evaluations of their work, often using third party evaluators to do so. Still, 65 percent of these same foundation CEOs report that it is challenging to have evaluations result in meaningful insights for their foundation. Thus, even those foundations historically oriented to evaluation find that effectively using evaluation results remains elusive.

While the challenge of using evaluation findings has perhaps led some foundations to eliminate dedicated evaluation positions, it appears to me that in the ebb and flow of foundation trends, more foundations are now creating evaluation functions as part of their operations; they are increasingly likely to link those functions to strategy and organizational learning. It is essential to seize this opportunity and plan for evaluations that are designed to purposefully support learning and decision making, as well as make better use of findings, in order to drive better actual results.

Many types of evaluations matter

The issue of “gold standard” evaluations in the social sector is controversial for other reasons. Many believe that the crowning of them as the preferred method has produced a kind of gold rush towards randomized controlled trials (RCTs), which are often narrowly focused, limited to variables that are more easily measured, and not always focused on what is most important. The fact is that there are many important forms of evaluation – performance evaluations, formative evaluations, developmental evaluations, cost effectiveness studies, case studies, and more. While well-designed RCTs can offer strong evidence as to what programmatic models work on the ground, and can inform government decisions about resource allocation, they need to be blended with other types of studies to answer the how and why questions about effectiveness. RCTs also are not applicable to evaluating things like advocacy or field building – areas of growing interest and importance in the sector.

Different forms of measurement are valuable to answer different questions at different times. If we are trying to maximize the use of evaluation results, it is essential to match evaluation methods to questions that are important to answer – that if answered, might make a difference to how one thinks about a given approach or set of activities.

It is time for a gold standard of data use.¹

More important than pursuing a single standard for measurement, is the need to inculcate the expectation that all nonprofits and funders will use data to inform how they work and the decisions they make. Sometimes those data will come from randomized studies; sometimes, they will come from performance evaluations and performance measurement to shape continual improvement and adaptation; sometimes they will come from case studies, a great vehicle for learning and improvement, especially in circumstances that require a good deal of professional judgment.

Part of increasing use of data and evaluation involves right-sizing evaluation efforts, focusing on what is important to know when, and anticipating how the results might be applied in practice. Paul Brest at the Hewlett Foundation has a saying, “Don’t kill what you cannot eat.” It is time to put our energy into consuming the information we take the time to gather, the reports we commission, and the studies we support. We should avoid asking for more data, more evaluation, and more analysis than we can actually make sense of in the time frames required for meaningful action. Rigorous methods are important, but the real gold standard is in use.

Hopeful signs ahead?

I invite readers to share what they believe are promising signs of more and better use of evaluation to inform decisions and adapt practice – in nonprofits, foundations, and even policy.

It is up to us…

Data don’t make decisions, people do. It is up to us to anticipate and meet our information needs, deploy ways to measure performance, commission evaluations with a clear purpose, and take time to use the data. Only then will we meet new vaunted expectations for a gold standard of data use.

 

¹In the interests of attribution, Jodi Nelson, my esteemed colleague from the Bill & Melinda Gates Foundation, was the first one to introduce me to this turn of phrase, “gold standard of use” and at the time, it was so resonant, I immediately said to her that I will use that phrase again.

 

Fay Twersky is a senior fellow at The William and Flora Hewlett Foundation and a member of CEP’s Advisory Board.

 

A Step Forward for Charting Impact

Wednesday, February 22nd, 2012

One of the most common questions I get from friends and acquaintances when I tell them I work at the Center for Effective Philanthropy is this one: “I was asked to make a donation to XYZ Charity. How do I know if they’re very good at what they do?” I point folks to Charity Navigator, now that they are expanding their ratings beyond just organizational financial measures, which I always thought was too reductionist. But they only review a small proportion of nonprofits. Guidestar can help, but for many organizations it is hard to make sense of what’s there unless you are a savvy reader of a Form 990 filing. We still need more efforts that provide helpful, easy to understand, and, ideally, comparative information about organizations’ goals, strategies, implementation, and performance measures.

So, of course, I jumped at the chance when I was asked to serve on the “Charting Impact” advisory group. Charting Impact, which was developed as a strategic alliance among Independent Sector, BBB Wise Giving Alliance, and GuideStar, is a common (read: comparable) presentation of answers to five “deceptively simple questions” that allow staff, boards, stakeholders, donors, volunteers, and others to access a quick perspective from organizational leadership about their goals, strategies, successes, and challenges. The reports are fairly quick to complete but they can provoke very important conversations among leaders of a nonprofit about what the organization is trying to accomplish – and how.

Additionally, Charting Impact reports are really quick for a reader to review. But they span a wide range of quality. For example, some participants skipped or were unable to answer basic questions about their capabilities for achieving their goals. That, of course, is precisely the point – allowing you as a viewer to get a better understanding of the quality of a participating organization’s thought process and work.

CEP’s Charting Impact report is on the site, and I encourage you to read it.

CEP’s President, Phil Buchanan, and others have in this blog written repeatedly about the substantial challenge of assessing and communicating about effectiveness and impact in the nonprofit sector. It’s a continuous refrain here in the CEP offices as we strive to identify new ways to help funders do so. Charting Impact’s efforts are a serious step forward in providing organizations a way to clearly and succinctly articulate their goals, strategies, and indicators of progress toward their objectives for the benefit of any interested audience.

There is no doubt in my mind that transparency of the nature encouraged by Charting Impact could help the public understand the distinct importance of the work being done in the nonprofit sector and distinguish among the differing levels of rigor and thoughtfulness. The project is still newly launched: as of February 22nd there are 115 reports on the site. Of those, very few are grantmaking foundations. Credit should be given to those funders, like the William and Flora Hewlett Foundation and the Tustin Community Foundation, who have taken the time to post their Charting Impact profile. I hope you’ll consider adding one for your organization.

 

Kevin Bolduc is Vice President – Assessment Tools at the Center for Effective Philanthropy.

Data Matters: Let’s Look Within To Find Our Exemplars

Friday, February 17th, 2012

The following blog post was originally produced for the NGen Fellows blog on the Independent Sector website.

When it comes to leadership, data matters. I’d argue you can’t be effective as a leader without understanding how to tap into data to gauge the effectiveness of your work and then drive (and inspire) improvement. So I am thrilled to see that the current class of NGen Fellows has decided to focus its project on “using data for leading the nonprofit sector.”

My hope, in reading the NGen Fellows blog post explaining this choice, is that we remember that this push is not new – that we have lots of examples to learn from. The Fellows write of the imperative “to keep up with our counterparts in the private and public sectors.” But even if the assumption is correct that the nonprofit sector lags in this area, and I am not sure it is (especially when you factor in how much more complex and difficult it is to obtain performance data in our sector), I don’t think comparing ourselves to other sectors is the most effective way to motivate change.

If what we want to do is inspire nonprofit leaders to use data more effectively to improve the impact of their work, we should look within the sector for examples. In that vein, I was pleased to see that one of the project goals is to “showcase organizations effectively utilizing data to achieve more efficient operations and successful outcomes.” This should not be hard because, while we can all agree that there should be more, the fact is, examples abound.

  • Why not look to the example of the Institute for Healthcare Improvement, a nonprofit whose savvy use and analysis of data has helped dramatically improve patient outcomes?
  • Why don’t we study organizations like Nurse-Family Partnership, which has developed powerful data on its efficacy?
  • Or how about the work of the Stuart Foundation and its grantees to improve life outcomes for foster kids? (Disclosure: Stuart is both a grant funder and client of the organization I lead.)

Wharton professor Peter Fader, who is Co-Director of the Wharton Customer Analytics Initiative, has observed that nonprofits often excel in using “their data to better understand their ‘customer base,’” arguing that, “in this area, big companies with lots of resources really can learn from their cash-strapped non-profit cousins.”

I think we do ourselves a disservice by painting with too broad a brush when it comes to the state of the sector in its approach to data. We need to understand both the historical and present-day examples from within the sector of rigorous, data-driven, effective nonprofit work. And, as a former strategy consultant to major companies, I can tell you there’s a lot of shoddy use, and non-use, of data in the corporate world. (Financial crisis, anyone?)

At the (nonprofit) Center for Effective Philanthropy, where I serve as president, we have developed tools to allow foundations to get confidential, comparative feedback about their performance from grant recipients and others. (Our motto is “Better Data. Better Decisions. Better Philanthropy.”) People widely assume that we used customer-satisfaction surveys in the corporate world as our model, but we did not; our model, in fact, was the comparative reports based on student survey results put together for decades by a consortium of nonprofit colleges and universities.

If we are to inspire more leaders in the nonprofit sector to embrace the tough work of being data-driven leaders, we will be much better off looking for models within the sector. My hope is that the NGen Fellows will keep that in mind as they pursue this important project.

 

Phil Buchanan is President of the Center for Effective Philanthropy.

Getting to Transparency that Matters

Wednesday, February 8th, 2012

On December 6, 2011, the Center for Effective Philanthropy hosted a panel discussion titled Power & Light: Grappling with Transparency and Effectiveness in San Francisco. Presented in collaboration with Foundation Center and sponsored by the Irvine Foundation, the event featured Phil Buchanan, president of CEP; Emmett D. Carson, CEO and president of the Silicon Valley Community Foundation; Christy Pichel, president of the Stuart Foundation; and Bradford Smith, president of Foundation Center.

The panel explored the tensions that arise as foundations attempt to be both more transparent and more effective about achieving impact. Here is some of what Phil had to say over the course of the conversation.

As Phil mentioned in a blog post shortly after the event, foundations have made progress when it comes to transparency. Still, Phil argues, there is a long, long way to go to raise the bar on what constitutes transparency that matters.

You can view a full recording of the Power & Light event on the CEP YouTube page.

 

Stephen Sullivan is Senior Coordinator – Communications & Programming at the Center for Effective Philanthropy.

 

Data Point: Is Evaluation Resulting in Meaningful Insight for Foundations?

Friday, January 20th, 2012

For our recent State of Foundation Performance Assessment report, we gathered data from 173 CEOs of U.S. foundations with annual grantmaking of at least $5 million on the extent to which they conduct formal evaluations of their programmatic work. The data indicate that evaluation remains an essential component of how foundations approach assessment. More than 90 percent of CEOs report that their foundations conduct formal evaluations of their work, and a majority turns to third parties to conduct that work.

A large majority of CEOs report that formal evaluations have helped their foundations understand the effects of their programmatic work. Yet using evaluation well is not easy: fully 65 percent report that having evaluations result in meaningful insights for the foundation is a challenge.

In terms of resources, how much are foundations putting into evaluation? Most are conducting formal evaluations for half or fewer of their grants. The median spending on formal evaluation is two percent of a grantmaking budget.

 

* * * * * * * *

To read about current foundation CEOs’ attitudes toward assessment and what foundations are doing to understand their performance, see the report, The State of Foundation Performance Assessment: A Survey of Foundation CEOs written by Ellie Buteau, Ph.D. and Phil Buchanan and published by the Center for Effective Philanthropy.

Ellie Buteau is Vice President – Research at the Center for Effective Philanthropy.