Skip to main content

tv   Key Capitol Hill Hearings  CSPAN  September 24, 2015 12:00am-2:01am EDT

12:00 am
through hundreds of pages of documentation. that's a big challenge. let me talk about the census bureau. that is the major source of our information about our people. this is where good data comes from again. they care about everybody in our country. when you see data from a lot of these private-sector sources, you always have to question, how representative is that data of the country? if someone is in public polling, if you have a pollster who doesn't have access or can't use cell phones which is a big issue, those numbers can be quite skewed. think about the last election. there were a lot of polls that were actually off. when you're actually off. when you're looking at data about people and you're looking at it from these private-sector sources, there's a huge question about the quality of this data.
12:01 am
what can we do to make the data easier to find. if you want to know what's happening to your community, what does your community look like? let's say you're moving to the d.c. area. what does the smallest church look like? how does that compare. what are the people like question my two they have characteristics you are looking for question you should be able to find out but it can be really hard. >> then we have the patent trademark office. when you're an inventor, you have to look at the patent database. right now now a lot of that data is very unstructured. it's not machine-readable and not all the data that the patent office has is out to the public. there's an analysis at the
12:02 am
trademark office where they are sitting on a bunch of data that hasn't even been opened yet. they all have very common themes. if you want to get the data out, there is people who find it and can use it. let's talk about the analysis of data. again, i mentioned one mentioned one of the biggest constraints we have in the country and for those of you in the private sector you might have a hard time finding these people who have the skills that looking at this data. so we looked at that and there's a lot of people in our society, over 10 million who are kind of data in their day-to-day jobs. 10 million are very data intensive. we expect that number to grow over time. we need to develop more people with these skills who can actually look at data and make
12:03 am
the right inference from it. look at these big data sets it goes well beyond excel. i know the department of commerce, finding people who had the ability to take large data sets and do something with it, that was really hard to do. when i talked to my friends, salaries are really high for this. we have to do a better job of educating people to get them into the pipeline to be able to do this type of stuff. as i said, smarter government. we could talk about smarter government all day. if i'm speaking from an economic point of view about where we are really going to move the needle, where data can help, it really helps where there is a lot of uncertainty and we don't know stuff. we have all been, had experiences for the healthcare sector are self. sharing information across the healthcare industry is very difficult. precision medicine is impeded by the ability to share information
12:04 am
about our dna, for instance. this is an area that is ripe for huge improvement because of data healthcare is about 20% of gdp. this is huge. if we can approve health care just a little bit, we can make it more efficient, that could have huge benefits to our economy. talking about criminal justice, there is just so much we don't know about the criminal justice system. you think about all these financial records across all these local governments and how they don't talk to each other. if you think about all your state and local law enforcement agencies, even within those agencies, data doesn't talk to each other. my car was stolen july 3. that's how i celebrated july 4. my car was was stolen outside my house. i called the police and they
12:05 am
come. have you ever had a car stolen before question mark. >> the police assume he just forgot where you parked it. once you get over that and you tell them you weren't drinking too much the night before and they actually drive around your neighborhood to see if they can find it, my car was actually stolen. then they say well someone probably took it and was out joyriding and it will turn up in a couple months. i talk to my neighbor and they said my car was stolen too. i went to the d.c. government website. one part of d.c. that works really, really well is parking tickets. right? [laughter] >> it is really efficient. what they set as you should go on the website a couple times a day and see if your car gets a parking ticket and if it got a parking ticket you can find it. sure enough it did. it got a ticket and told me
12:06 am
exactly where the car was. i went there in the car had been moved. but the parking folks do not talk to the police department. the person issuing the ticket had no idea that the ticket they were issuing to was a stolen car. that's an example of how i think the criminal justice system, where you have separate data systems that aren't talking to each other at all. there is so much room for improvement. then if you think about all these questions about incarceration, what are laos should be for certain valuations it's amazing we are some making such profound decisions about people's lives in an area we just don't know much about. let me talk about one other example of merging data.
12:07 am
one of the last things i was able to do before i left office was start this process of merging data on our veterans with data on employment outcomes. why do we care about that? we really want to know what happens to veterans when they enter the workforce. we want to know how that varies and how many tours of duty they did. how long were they in the service? what did they do in the service? we want to know the relationships between all these things and what veterans program they received. they don't know how efficient, or what the outcomes of are of all these programs. we just don't know very much about all these other programs.
12:08 am
maybe they're all great. maybe they should be funded more. maybe someone having the outcomes we would want. we just don't know. i merging these data sets together we could figure it out. we look at the u.s. economy that has been growing, one of the really big questions we have out there is productivity growth. you probably don't think about productivity growth but if you think about how fast our economy grows, that is how fast our jobs grow and how efficient our employment is. why are we growing faster in this data revolution where we
12:09 am
hear all these great things about dataquest month there's this big conundrum there. maybe what it is as we have all these businesses gathering the information but they haven't yet materialized the benefits from this data from a macro perspective. perspective. that is a huge huge question. so all this data stuff, how much can this improve the u.s. economy? i'm just in a throw out a couple rough numbers. there are studies out there that always talk about trillions of dollars and billions of dollars, i always find those numbers hard to understand so i can't understand a trillion dollars. so you have all these numbers and let me put them into context. if you think about all this bad stuff, if you think weakening
12:10 am
proof by 1%, think about all the improvements would get in government and the private sector. 1% is an very much, is it? an very much, is it? 1% of gdp is $175 billion. that's hard to relate to because it's such a big number. that's $543 per person. that's about $1300 per household. that's a lot. the median income is about $52,000. that would be a nice bump. that's if we could just improve the economy one percentage if we could just improve the economy one percentage point in the state of data revolution. that's in a conservative estimate. if it improves by 5%, that's over $2700 per person. there's about 2.4 people in the household and now all of a sudden you're talking over $6000 per household increase.
12:11 am
that's why everything that you are doing and everybody else is doing is just so important because the better information that we have in the better we can analyze it, the better decisions we can make and get better outcomes we get better government because our businesses are more effective and those businesses thrive. then mourn portly, our citizens become more informed. i'm not sure what dollar amount you put on that, but that is important as well so we are at this tipping point and we have more and more data and more more groups like this. we have more data that is assessable and usable and actionable. how quickly we reap these benefits and the benefits are
12:12 am
huge. it can make a huge improvement in the quality of lives and hundreds of millions of people who live here. we have to make our data more and more assessable and we also have to, one of the biggest constraints we are facing, when i talk to people in private sector, is we have to invest as a country into the skills so that we can take advantage and leverage this kind of data revolution. with that, thank you very much. [applause]. >> thank you mark. we have time for about ten minutes of questions. i think i will start out with the first one. you just finished six years in government. you are advocating for fact-based decisions and the advocate to see of the use of
12:13 am
big data. >> i don't work for government anymore so i don't care, but one thing i am doing is writing a book and i would like to talk about all these different areas where data can really improve the quality of our lives and really improve our country. what i noticed across all of these areas where they're talking about accessibility of data and the government spending and healthcare data and veteran data, there are all these common challenges. how do do you get all the stated together while maintaining privacy? on the one hand we have this ability to take all this data of many different aspects of our
12:14 am
lives, combine it so we can really answer these questions. again just think about the veterans example. how can we do that while maintaining privacy or the perception of privacy. the american public is getting very concerned about information that the government has in the private sector. we want to use this data on people for good. so if i have data on veterans, for instance, if i could combine it with labor market outcome and look at their credit scores and look at how much debt they have, are they making their mortgage payments? i could better design veterans policies. i could better design programs why while they are in the
12:15 am
department of defense so that they have better options when they leave the defense department. to do that, i would have to combine data from a lot of different sources. our society has gotten into this big question about how do you do that while maintaining privacy and this perception of privacy. on one hand people give a lot of their private data to private companies. facebook knows a lot about me. facebook provides me a service in return for that. when it comes to the government doing this, or even the private sector getting more and more data on us, there is this real fear. i think it's an interesting balance. on the one hand, the more data we have, the better decisions we can make. the more data we have, there's a heightened anxiety of people. how do we have this conversation with people in order to do this? this is something i would like to really work on in some capacity at some point because again, i really do think that if we can leverage all the information out there, we can really move the social needle quite a bit. >> any questions out there? >> my question goes back to your
12:16 am
model. one of the things i noticed that was a little bit absent from that model was the very beginning, the visioning on how data is examined or collected. to the extent that data analysis and data decision-making is a science, to what extent does there have to be active manipulation to get the type of data you need to make the decisions you want and to make sure those decisions are the right decisions but mark. >> that's a next line west appeared when it comes to hypothesized testing, that is is really hard. when i was talking about experts in the field, you are going to find lots of correlations and patterns. there's a big phrase we've all had which correlation does not
12:17 am
equal causation. there's lots of examples of that. i think what will happen is, you've seen it before in slow-motion, the process goes back and forth as an iterative process. you look at the data and say what hypotheses can i look at with this data. then you say what data do i want. then you go collect that data. anyone who has done big data science before says wow, that was really cool. then the number of questions begins to multiply even more than the data sets themselves. as a country i think we have to be more adept that saying this is the information we have and this is what we can glean from it. what data should we be gathering? we need to to make sure that it goes both ways from hypothesis to data and then back from data to hypotheses. >> i think you had a lot of very
12:18 am
good points. you said 91,000 local governments and the potential to impact every citizen in the country. usually when we talk about data we talk about the national level on very broad scales. what are your scales. what are your opinions of making this a way where every citizen or business can consider how they change their processes or policies and making data assessable and open data analysis to improve their analysis. >> when it comes to local government, i think we have some groups over here that represent local government. we traveled around and spoke to lots of government organizations as one of the previous speaker said, some of the big cities, for instance released data in a pretty good way. so there is a company or to to
12:19 am
and they think what data do people want? they had this active community of, what you call them? they would bring in these people to the city of chicago and say here is our data, do something something with it. i think what they found is sometimes there are data sets that people say there's not much use for that. then there are data sets that they found really interesting. so, for instance, one example of local government in chicago is
12:20 am
if you take a picture of a pothole, they send it into the public works department of chicago mma posted on the department of chicago's website. then they are accountable for filling that pothole in a timely manner in a timely manner because everybody knows that pothole was posted onto the website. i thought that was a great example of making information available and then making them accountable. the challenge we face, as you mentioned, is we have so many local governments. a lot of these governments don't have the resources to do this kind of stuff. you're talking about data science and making data open and data analysis. a lot of these local governments, if you look at them most recent recession, one of the biggest drags we had as the economy coming out of the state and local sector. it just plummeted in the state and local
12:21 am
sector because of housing prices and property values. as they went down and tax revenues went down, so what's happening at the state and local government is you have some that are really working on it and some aren't. maybe what we really need our standard best practices across all these local governments. i think now that we've been doing it pretty well in the larger entities for couple years, they are beginning to learn what these best practices are. i've spoken to many of these local government organizations and they get this, but at the state and local government level, it is the human capital constraint that they are really facing. >> i'm the director of
12:22 am
intergovernmental relations. the question i have in the human capital aspect that came up in the last question is relevant to this, all this data stuff, as you mentioned is taking place at the same time where people are getting frustrated with the cost of higher education. at the same time we need people trained in this data stuff. what implications do do you think that has for our educational system? what's the best way to educate people who are knowledgeable on the data stuff? >> one thing i've really pushed on his the community college system. if you look at community college systems across the country, i think they are doing a better and better job working with local businesses to match the
12:23 am
skills of workers. when you survey businesses, which you often see is there is a skills mismatch issue. we can't find people with the right skills. how many many people are in that mismatch category? it's really in the millions. we are thinking about getting these skills to do this data stuff, we have to think outside the traditional four-year college degree. also, society always pushes lifelong learning. how do we learn the skills? i'm teaching myself the data processing language. what i'm surprised that is the number of online courses that don't have anything for me to learn this. it requires a certain amount of dedication and i have to know a lot of people who know a lot about the stuff so i'm pointed in the right direction.
12:24 am
fundamentally, we know the cost of higher education has far outpaced the other part of the economy. to answer your question, i think the think the community college system is really huge. in your first year now, you take a computer programming course in many places. it's a way to think. this is something that everyone should be familiar with. when i was in high school, a long time ago, i took fortran. that was the exception not the rule back then. what can we do to teach kids today things about coding? again when i was under secretary, there are a lot of camps work kids would learn java and then they would make an app. that makes it really fun. i think the way computer science used to be taught was done in a
12:25 am
really nerdy way. it it was done in an exclusive way. not only is it where you get your education but it's how the stuff is taught which i thing is really fascinating as well. >> thank you. that was a a great question to wrap up our keynote. thank you very much mark doms. [applause]. now more from the data transparency : conference with darrell. as part of the conference it is just under an hour.hfeaç >> hello everyone, thank you for being here and braving the traffic in support of the open of structured data.
12:26 am
you really don't want to hear me talk as you do the man on my left. i will be quick. research data group provides compliance services and software tools to public companies to help them communicate with investors and comply with regulations with greater ease. we've been in this industry for nearly 30 years. we are excited to see all the changes that have taken place recently. our country is finally pushing forward with real changes. now if we now if we can just be more effective and efficient. those words normally do not correspond with government but we are finally making the move in the right direction. the single most important change in the past ten years has been the passing of the data act. we are very, very enthusiastic. we understand that opening up government data can help everyone in the country. not just idealistically but it is going to be in a very practical sense, the organ asic
12:27 am
organization that links organizations and people make better decisions in all aspects. many people have a hard time understanding how large and undertaking the transformation from static documents to searchable data is. our speaker for this panel is representative darrell. he knows this cannot be achieved by one team or one person or one agency. it requires a concerted effort from many individuals. he also knows there is more to be done and more leadership that's needed to really transform this data rather revolution. without further ado.
12:28 am
>> with that kind of intraday introduction, i should just take the applause and leave. first of first of all, thank you very much. the one name you didn't mention is without partners on the hill, things don't happen. my partner in the data act in the senate was senator warner. i think it is extremely important to understand, he was the one that went to harry reid and demanded the bill be moved. although senator reid, ultimately we made law together. the data act is just as it said, a major piece of legislation but it is just a start. you can write legislation but unless you oversee it and implement it and are diligent day after day, it will be
12:29 am
meaningless. today there are many cios who are not confident nor do they have the financial controls, the budget controls, of their project. that's another major stumbling block. it doesn't mean there hasn't been a law passed, it means we in fact have to stay on top of it and we have partners in that effort. i think most congressmen have one thing they can do very, very well and that is they can talk about their next piece of legislation. i want to get that out of the way without any mystery. the financial transparency act, obviously law, the next steps are to insist that we make all data in government just as good and just as available. some of it is hard. just before i was coming up,
12:30 am
they started saying, what, what happens if you're taking a picture of a pothole? how do you make a picture of a pothole on a machine searchable? if you are using a camera that is modern, you will have the gps location. you will have the time and date. you will have rich metadata and if not lost, it does make a unique location for that unique picture for that unique time extremely valuable and searchable. it may may not tell you why it was taken or whether it has been fixed, but at least it's a start. i want to mention one thing that i have a passion for and that is modernizing. the data act is a standard that helps a tool. the freedom of information act is, today, in my opinion, a great success but is a fraction of what it was intended to be
12:31 am
and what it could be. every day, countless individuals, companies, news organizations and law firms try to receive information. the first thing that happens is it goes to a human being who begins a search process who then begins looking through the data in order to redact information that will not be given. literally a human nightmare to be able to do it. under the data act, we act, we envision metadata will be so easily searched that when you're looking for, you won't even have to ask because the vast majority of information will already be available online, with appropriate identifiable information and other fields that have been predetermined.
12:32 am
it will be limited to, i looked at the data, it indicates something more and i believe i have a right to some portion of what is redacted. knowing what you're asking for and cutting down the number of requester magically because the majority of what you want is already there, available, online and searchable and to be developed is a good start to making government open and transparent. i think one of the most important things i can do at my age is to tell the young people in the room how we got here. why we shouldn't be here but why it was logical some how to get here. nearly 40 years ago, i ran my first computer program. well i ran part of it until the card popped up saying i had a flaw in my program.
12:33 am
yeah, giggles in the back, you haven't held a stack of cards with three failures in it and only one gets shown because you have to run it again before you find the next mistake line by line by line. in those days, days, we all understood that each card was simply more or less of zero and a one. everything was. he data and we were turning it into something. over the next few years, we turned computer programs into devices that could be run for all areas. you could bypass one error and go on. we also began also began printing out massive amounts of ascii characters on printers. absolutely useful information unless you read it. behind that, if you had an index, you could find out anything you wanted to find out about the data you were building. at that moment, whether it was a
12:34 am
deck or a digital corporation or an ibm or an hp or a myriad of other companies, many of whom are not here today, had we said we have the beginning of metadata, we have what we index and let store in those characters, we would have been fine. but we didn't do it. what we did was we continued along with proprietary indexing. proprietary calls that were embedded with no standard. many organizations over the next many decades built standard after standard after standard that were, well you can have as many standards as you want and everyone picked a different one. today we know that we can build standards that everyone can use or export two or make available and still maintain their proprietary calls. that's the future that is now with us.
12:35 am
one of the questions is, how do do we get from a law implementation, and there are really three components to it. one component is very, very clearly public demand. the public has to look at the benefit they get from open data. all of us who know and can know where our airplane flight is coming in or even while you're on the airplane, find out exactly where you are, are benefiting from data that has been made open for the application industry. all of us who have an app, everyone in the room has a weather app somewhere. you only use it when you worry, but it is there all the time. again, data made open imagine if all the spending of government, all the vendors were made open
12:36 am
and available for nonclassified work. imagine how quickly you could find out that the government, through no fault of its own is ten different prices for the same product. in fact they may buy once from the company that manufactures it, once from a distributor and several times from retailers and not even be aware that the way they went out for contracts they did that. imagine how how much savings we would have. let's also imagine a world in which government stops. i said there were three parts. government stops making that progress and goes back. what do we do about it? is it natural is it natural for vendors to say, the data act is great but it might hurt my particular revenue stream downstream so i'm not going to do it? in less the executive branch says no, we need it. , we are not looking for open software but we are looking for open data and we insist on it.
12:37 am
imagine as government goes from one program to one person to one time to another, congress simply closes their eyes and said we passed that law, were good. i think you can quickly imagine that if congress takes its eyes off the oversight than the weeks and the months and the years and the decades will go by and we will still have legacy programs and post legacy programs and post post legacy programs. such as the programs in the 60s that the irs claims they are still using with computers from the 60s. we can still have that and pay a huge price for that. under the data act, the government has huge responsibility. treasury has has huge opportunity, but when i said that it's public, the executive branch in congress would have the primary responsibility, i should have
12:38 am
said the public must demand in the public must continue to demand, it's the public that must ask why not because the only way to get the executive branch to stay on it is for it to be important in a political sense. the the only way for congress to stay on it is for it to be meaningful at organizations like this that are dedicated to it. i charge all of you, we passed the law, i intend for the rest my career to stay on top of it for the best of my ability every day, but you have an opportunity and others are doing it right now. every time you build an app or try to build an app to take advantage of data that's being available, market to the public the benefit of rich data sets and your frustrations need to be communicated in three ways. to the executive branch, your
12:39 am
congress one might say i'm here today, that was a good line, i met met i'm the congress for today, and lastly you have to communicate it to be public. do not go quietly into its going to happen next week, next month, next year., next year. it's not in the budget this year. if you have a success market it to the public and say it's because of open data. if you're being thwarted or delayed make sure you go just as public with it. ultimately, somewhere there is some bureaucrat, bless their heart, i was a bless their heart when i don't mean it, and they are just a matter of weeks or months or years from retirement and they just don't want to have that challenge. the people that work for them and come "after words" want it. for all the young energetic government workers who want to be thought of as the leading edge of technology, make sure
12:40 am
you go public there somebody there that is looking and saying that will happen on the next person's watch because i only have three years left till retirement and that's too much hassle. you notice i didn't mention government contractors. i didn't didn't do so because my assumption is contractors do what is important and what is put into the bids. one of of the areas that i'm working with other members of congress is to ensure that congress begins pushing the executive branch to make sure that it's in the bid and there is a benefit in the bid. the old some contractor or contractor for the government or government agency directly should beerni modng a program and have an incentive to take us from where we've been to where we need to be. that will be a monumental change. on-time, on budget of course is
12:41 am
important. on-time, on budget and saving the american people billions of dollars over the next decade by opening up data has got to be in the bid. you won't see at the day the software is delivered but you will see it for generations to come. i just want to make sure i didn't miss anything. in closing, we are just starting. in closing this is the opening round for open data. there are companies that will take advantage of it and make fortunes. there are nonprofits who will take advantage of it and embarrass people in the administration, not just in this administration but the next one in the net one after. if i have
12:42 am
my way, the same level of open data will be great for the public and questionable for congress as it gets opened in all the branches. i want to close by saying, this is the start. i'm delighted you are here and there is in fact a coalition that dedicates itself to the same thing senator warren and i were honored to be able to start. the conference is a delight delight to attend. i look forward to next year having a list of accomplishments because i believe this presidency, which was promised to be the most open and transparent does have an opportunity to show that it can open up government in the areas that are least understood and least transparent and do it before the lights go off for this administration. i think they will. i think they have set a course, they have appointed good people and now the question is where
12:43 am
will we hold them to a timeline that is the same as the timeline for the president. if any timeline is offered to you this year, if it's one day after january 20, 2017, then it's not a timeline, it's a dream. we don't need dreams or promises. we need what will you deliver before january 20, 2017. pink you very much for being here. >> we will take some questions from the audience. >> one of the requirements of the data act is that they have to get their reporting and recipients have to report back. what do you see as the biggest obstacle in achieving that and
12:44 am
getting current recipients to report? >> that is a great question. grant recipients was one of the reason it passed out of the house in a bipartisan basis and sat for two years. there are two reasons there is an obstacle. one i can understand. your university professor your university professor and getting a couple million dollar every so many years. you are used to loosely living up to the grant but perhaps, hiring an administrative assistant here or there that only loosely work on the program. the data act is intended to really follow the money and see whether it is auditable as being spent appropriately to whatever the grant was for and we think that's important and those who shy away from it often do so because it's nice to get a pot of money. i don't want to say laws were broken or anything else but co-
12:45 am
mingling and moving around of money has gone on in universities since the ncr 500 and me and university. that's one challenge. the other challenges we in government, we write grants and take applications and until the administration realizes that, no one should have to enter data twice. every entity should have a number and once it has a number it is personal identifiable information. it is single database should be there. just like most of us when we login, we expect we expect to login and it doesn't matter all matter if it's the cloud with google or our local device. we want to login and have it say hi darryl and have all kinds of information there so we don't have to enter it twice. if you assume, for example at
12:46 am
the university of4h information is automatically populated, it's not asking you information but asking you to fact check what comes up, then you see a reason for for this information delivered this way to be valuable. that is government's responsibility. live up to the dream that you shouldn't have to enter, again and again and again, even if it's at a different agency, the exact same information. allow that information to be valuable to the university that is overseeing thousands of dollars worth of grants. :
12:47 am
>> >> >> don't really understand to the piece of data looks like so they take the obvious things like a more social security number but there are a lot of other things that are not as obvious like more data released for more and more resources we could take those datasets to run them together to figure out who is to order if things are mashed incorrectly like the
12:48 am
states had earlier so my question is as the open beta advocate to create a political backlash so what policies are put in place to help these agencies educate themselves so they dulce dated to buy it themselves in the ass later on. >> and bass is a technical term? left last. >> that is a great challenge of the metedata not properly define the federal government has an endless amount of history of how we define what did name is lower social security number but programs have been written without compliant metedata identifiers and that is a matter of saying here is a federal standard
12:49 am
can the data be searched? the last figure what to do is if you have five spreadsheets with day different name the bill one that and don't need it but government agencies need to put their data in a format but having said that your challenge as a software developer -- developer, and the post office happens to have of great program that almost works. come on. if you take your data set because it is to enter name and address and zip code you give them the data they have a wonderful program that actually corrects almost
12:50 am
everything. it will change your abbreviation from avenue and st. to make it compliant and as does the code plus and that physiologic that it takes is pretty amazing to take bad typo's with data entry for names and addresses to make it right. software for the interim will have to do a lot of that with government data. for the short-term the will get a lot of data that somebody imbedded then they must second or third time without a field in decatur will take some cleanup. that is one of the opportunities for software companies if they're working with the federal government on compliance to scrub the existing data to apply
12:51 am
metedata so it doesn't have to be further scrubbed in the future and although there has to be funding from congress but those earmarks to get data that you should not be in fear of litigation because the government should scrub so it doesn't happen if the post office can be a part of the solution then you should not have to worry about that be meanwhile reasonably scrubbed. >> i think the data act gives a huge amount of valuable data but i would like to look across to say
12:52 am
where is the spending on the save program or the save activity? or on this a mission? not because they care about spending but to find out duplication or the need to coordinate. is there an interest or commitment to take further choose say agencies are required to identify the program but one mint may save water quality safety another may say water quality research will there be an opportunity to use those cases i have described ? >> the answer is yes if they get cooperation from the administration we should not need a new law. the omb setting or requiring all the agencies set a
12:53 am
common standard can do all lots of this. years ago i had a simple task. i just wanted to know the jet aircraft the models and where were they and who owned them. i was a junior member sarah they laughed at me. as the we are -- the years went on i kept asking. it is amazing even the department of defense has a whole a joint group with bolts of officers trying to figure out where all the aircraft are and who is controlling them. that is simple. you spend a few million dollars there is only so many, maybe 1,000 non-combat fighter aircraft. it should not be that hard
12:54 am
to figure out if you wanted that information to date it would cost the government a fortune because the coast guard has there's a and this has one. you are exactly right. interoperable standards the that is the same thing to be named to the same way that it has a number identifiable so not only find exact matches but even with a characteristic difference a unique metedata identifier and that is what we're getting to is a standard setting for what you call something if it is identical or a difference and omb has a responsibility to build interagency cooperation to get that. we're hoping they will be one of the first because if i have a caterpillar tractor
12:55 am
in the army navy air force marines and i need a part it is somewhere under an agency the lasting a want to have that down while somebody is waiting if we already own it but is going to disposal. that happened today. it cost us millions. if you run out of questions i have a second speech. [laughter] >> em a consultant with deloitte i never heard you talk about the data act to support the concept of establishing a common language. i have a powerful concept how that might impact congress' ability to support its function in its capacity to represent people.
12:56 am
>> alloy answer the question as i interpret it. i think i heard you say what will happen if it is fully implemented to where all data can be made meaningful by searching multiple databases like google going out to every newspaper what they know about their congressman and. that question in the american people have is if the money is well spent at a particular veterans administration center is based on a shortfall of doctors with the efficiency within the hospital. questions like that, in a democracy, our public is a
12:57 am
representative democracy but the ability of every individual to know more before they ask a question of their congressman or woman to get information directly can have a dramatic effect that would love nothing more than for my constituents who were waiting for service to know though wait times or the ratios with the amount of care to say my hospital and is underperforming and as a result i am not getting care. what's wrong? that kind of a question is so much more powerful and really try to get them in faster. i see that as empowering to members to produce a lot more constituent requests
12:58 am
but that is very targeted because the information is already cleaned by the constituent or a case worker at a computer screen than waiting but much of this would only happen if a software industry supporting this open data supports those tools i have no illusions that one of my case workers there is no way she will develop a program to search the database but it will be made available if the data is searchable i don't care i just wanted available to my case worker and 534 other case workers.
12:59 am
>> line of the things with implementation and the heavy lifting is that the agency level. as we've talked with agencies they have different sentiments of the data act themselves to value the analysis and analytics. covering their needs for standards. to agencies have the capacity to fully implement the act? so in casual conversation that they don't have access
1:00 am
to some of the data so what whoof you say about that capacity issue? >> i missed hearing that. >> i don't currently work for one i have in the past. >> accused said they didn't have the ability. the the answers to a question in light that quite frankly is the leadership question from the of white house. to have the ability if the omb leaves on behalf of the
1:01 am
president to say we need a plan from every agency. what are ur your short-term with that space being true to what portion is already in a format to be easily made available? what guidance do you need to set those metedata standards? some have been asked from those to do some of this bill you are right to it is typical for an agency to say when the process goes
1:02 am
forward to give guidance look at the sec i can never pronounce the word that says that they just don't want to do which. but quite frankly some of the agency's are far along to having their data in the right format in not providing it in some cases. this is where the leadership ted is important. with that determination what they need to do and it has
1:03 am
to be implemented that is why we have people better called the legislative appointees. >> than to meet with you anytime. but intel day tell us what we need when you hear that from the agency the only question is what have you done to find out what your capabilities are? you are told where you have to end up. i would propose to all of you if the agency at some
1:04 am
level king get a piece of the information in their proper format where they note the information and when day get done with their system to deliver on a spree nor a piece of paper coming to export to xl, if you can do that with a and agencies already have everything except metedata aside and to those fields of making a interoperable. you are already mostly there. within a a program almost everybody has everything they need. with identifiers and vetted. if every agency were to publish their keys compared
1:05 am
to another agency's if we do at one time then to add another number that is a common number throughout government now you never have to do would again. that is the magic's to let get the ambiguities and compare and fix them now you don't have to spend $100,000 every time congress who asks for a report because the common unique and number for the event is simply a matter to say i know that you all made up numbers now we will not make it up and assign a
1:06 am
common number. at the local level it cannot be done that with leadership fell local level can prepare the information for the process to do its job per cry saw some ahead shakings my baby rights on the one part of technology. anybody who comes into talks about the early computers lose is the audience immediately except the gray hair but mine began in the '70s with me looking at everything that we cannot do it with a few bits of data that we imagined that we could have now free just take the time to imagine it. >> said cheap seats ask
1:07 am
great question as but they are shy but of french is the great questions. >> i hope i live up to that. i work for the association of government accountants. this question probably will not surprise you but i am kerry is in terms of the accuracy if the analytics don't work if the data is accurate to begin with. one of the things we have been kicking around is the management certification issues so when they submit the data into the system they have to sign a statement tuesday to the best of their knowledge the information is correct. is said alogical approach if not what is a good approach to make data more accurate?
1:08 am
>> i like yours. as someone who's headed a company and to get public as a board member i am acutely aware i have been increased the pay of my cfo when he and the cl had to sign the audited financial statements. it is amazing repast sorbets oxalate to require both to sign and you could go to space jail europe personal accumulation of treasurer could be taken away if it is false to make an assumption of reasonable due diligence to know that they are correct not just i don't know. that standard is perfectly
1:09 am
reasonable. but the question is is that coa equivalent that is something the government has to work out. that those who decided they would bounces around but the smart people refuse. [laughter] by then somebody didn't know so that the people he knew
1:10 am
better were sidelined in and we have to do do it is passed to work or airman and barris the president is not have all but will be. rehab fatuities chairman at the start who will sign that an mp in a position they have sufficient to transparency that is the case. there is the right way to june but but years ago but
1:11 am
the chairman comes out to be with the most popular ideas to show us where the money is being spent. of course, it was the disaster for political purposes the administration wanted to show how much money in every congressional district in the first week they had a heyday i thinking california was 58 or '01 40th avenue york they corresponded with nothing and it was wonderful because the public saw it and said this is absurd and it was corrected. so working with those involved jihadist system because of its transparency and the data was open and available for all the flaws they were corrected, mostly
1:12 am
because we had 300 million people who had a vested interest to sometimes mocked that i feet that is the greatest success because it kiam be fixed built to be open and transparent a anjous search for data in the $800 million program. i may not have liked it but at least read know where the money was spent and software was a good basis for us demanding all of government do it and hence the data act >> the last question was so cool i hope you can live up to it. i am a the co-founder of the woman i t firm focusing on development we do a lot with
1:13 am
usaid. so how to use the and what can the government do to encourage partners to our less they and transparent to change their mind? >> guys said bless you for a reason. i will tell you how big the problem is then give you a partial solution. early august only a few weeks before the new president had come to the white house and treasury had briefed job the fact the minister of energy had
1:14 am
stolen half a billion dollars essentially every month for more than five years. the arithmetic half a billion a month every month is pretty amazing that a great deal is real-estate so the treasury says if you charge her with the cry we can freeze the assets to get your government-backed this large amount of money. she has not been charged because they're stealing just as much. though whole game in nigeria is so corrupt everyone every level steals. and if you say anything but
1:15 am
but that love all of corruption is says anti-week give them invade every year but is around the world including the united states but people are suffering. but i kidnappings heroes in a rand ransom but some are so tough that you cannot fix them. on the other hand, if you want to put on mayor you can look yancey in tire government spending on line date got reducted of cash payments and getting rid of a ghost employees 10 years ago. the fact is that you do the
1:16 am
best you can with the country that you have. do we give half a billion dollars for a rich country stealing for itself for not? butted nigeria's that decision is really giving money to a hopeless situation as much as to the bike to help that government has the ability but in that got has the ability but in the case of other areas the government says i need the help said new make the standards but the ngo are not being held up but to deliver services knowing that there is a lot siphoned off but in many countries that the data act it can
1:17 am
help you see the performance of the ngo. a can follow some of the of money in the banking system. we can do a lot to inform the department of state. item number we can make that decision led to redo in afghanistan? because every one of those countries is a different challenge and a good data is only as good as the first dollar spent. but the vast majority are honest and those that are not they're telling you loud and clear they're stealing. and i work this a lot it is a matter that if there is a
1:18 am
real accusation that they are getting ripped off is a management decision. that they're getting ripped off by others, it's a management decision. if you give the inspector general direct access to the data as it is, in most cases, they can find it and they can disqualify those ngos. that means more money arrives where it's supposed to be. again, i can sell the data act for what it can do, but it's only going to be as good as the transparency to the public who insists on their money being well spent or the agencies who now have better transparency making good management decisions. i don't want to go any further because i've stated as hard as i can on that particular issue. what's your closing closing question? can i leave now? [laughter] >> we have a networking break. >> thank you.
1:19 am
[inaudible conversations]
1:20 am
that is equivalent to representing 20 percent of gdp with one set of use to navigate from the highest level to the lowest level. that perspective will be
1:21 am
very powerful asia and transformational have retaken of allocation of resources. the data act implementation is not the only open data initiative in our government today it is publishing tens of thousands of new data sets and of the panel hopes to leave the executive branch and the open data initiatives and with the digital world to help clients to be suggested and service oriented with capabilities to unlock the value of the data set currently held with the organization's. with the public m private partnership with the
1:22 am
allocation and of resources across the country through better zero bin data exchange. over the past couple of years our executive branch is focused on an open data in digital and transformation. we have the policy today this is a change for us. where we hold onto our information but that nobody has the intended outcome with the implementation with value creation and. to provide us a great perspective and is happening in the executive branch to.
1:23 am
>> a senior represser floor office of setup -- science and technology responsible for deciding and implementing u.s. policy on issues coordinating domestic efforts end with the federal agencies so pryor's this the attorney adviser to serve the freedom of the reformation act responsible for the agencies to resolve and agency compliance. the deputy chief customer officer at the general services administration in helping the agency to read
1:24 am
data driven decisions of technology and says a senior advisor to the gsa administrator from performance manage risk strategies in before that a financial manager in analyst in the office of travel murder vehicle and park service's. finally of a fiscal assistant secretary of u.s. treasury to develop policies to oversee the operation and the first commissioner establish with the consolidation of the bureau of public debt.
1:25 am
also serving as acting director for the day to day operations. we have a panel that has great perspectives among is happening across the government to make them available for public use to of lot value for society. please join me to welcome them. [applause] spirit really hear from the white house office of -- science and technology. >> good morning. there was an amazing lineup today.
1:26 am
with the day back but what is the big picture? >> is specifically referring to the open government memo this collaborative of the work that they do and of course, the freedom of permission and act but to capture one to push the f.
1:27 am
byrd for word to to run december 2009 what the administration mingy be more open in and collaborative. part of that directive to be given at identifying those that are valuable reno wedded data sets is more than 160,000 open in government data sets end it
1:28 am
also involves the efforts but a round the globe opening the government partnership but to let them more accountable governments. we also have a slew of open data. powless since 2011 focusing space the data efforts to regain to mix the and across all sectors with more then those three value datasets
1:29 am
would get that great worked with the new default information n to in those environments to have that ( data policy that followed it to to manage a affirmation of data access. this is founded 2011 and by president obama the great thing of the open in partnership between the countries but a dozen civil
1:30 am
society organizations know at 66 countries together it has created thousands of commitments around the world to be open-ended end transparent so this is a platform to and finance -- today and city way including zero bin data. for every two years very specifically commitments that they will take with civil society and it is across all sectors related
1:31 am
to its transparency of her cultural data. and that last plan that was published just tunnel of a little too soon had a net date specifically that is incredibly helpful to allow us to track the work we're doing a loving as to read variants on into the world stage to share that with the other countries but does have a strong focus saw an open data that they have
1:32 am
included funds from all over the world but the zero gpa has a big segment each year to deliver in the national action plan to do to be presented that is coming at the end of october so we have a giant delegation is planning to go if you're interested -- if you're interested you are invited.
1:33 am
making >> there is a home page of 160,000 datasets across all sectors. so another effort publishing
1:34 am
virginia open data action plan and then to publish an open data planned for won't create have a document into managing the data. >> a few examples of how the united states is striving to open data and how we bring that back to our partnership. ottman topic but it may not be on everyone's radar.
1:35 am
that comes from the u.s. land of gas and oil values and timber. in what is pitifully created but the interior look at the transparency initiative but they did not tell the criteria they're working to get us there now we're getting there. but the key criteria through that open data work to loom
1:36 am
visualize that data so it is in this line of work and other great example is where the united states with lots of money and people want to track where those funds go in the friends are hopeful to find the data or cleave the data to be sure it says usable to be visualized
1:37 am
their the florida system inside. to open data to make it easy for just about anyone. next epo the agenda to make better data available to be in a friendly way is through usa's spending and my colleagues will talk about that so i will talk about from the process perspective but that the u.s. say spending team at treasury at my colleagues have been amazing it is an open process something we've opted north carolina in policy making.
1:38 am
it is fantastic as say build improvements every step have sought the back from mentor agency in a civil society ma the key holders we would be remiss to leave out and it has ben useful has an example even in other sectors juicy how successful this process has been so i will stop there and we will listen to my colleagues at the table. [applause] >> i am with said gsa chief koesterer officer and i am here to speak how the gsa is helping companies get
1:39 am
started cry has been there 90 years and it always surprises me we're this of known organization that operates behind the scenes but we have a sexy mission ever reach writings first for government that helped other agencies get started their candied these executive orders that come out of the catch that and then test them that help other agencies get started. there has been focused iraq of opening day dash and citizen experience and gsa is a great place rose to get started. summit to pick up this specific work then hold the promise for the future.
1:40 am
so with all bin data that platform has been around but they help consumers make better decisions and says they release all consumer complaints of of the baby every single comet comes out even airline's on-time but with citizens it is a great platform that we manage to
1:41 am
clean a the datasets. also add a api technology. from that citizen experienced event we have been around for about of your with the interaction use of government a small team but we're pushing hard in a few areas.
1:42 am
in to reheard there are pretty good information from the experiences with it didn't have data. it is a physical kiosk to go through the security experience and then give realtime data on your experience. but they see that it is
1:43 am
crazy son just like the dmv to expect higher than normal traffic levels. into improve this is is is. and then to open it up to the public also looking at standardize transactional level if you complete a college chaplet cash when dash application form that is exciting for us. also launched for government is yelp also all terms of
1:44 am
services is to allow them to interact with the public. we just initiated yelp because there is a lot of pages already established for agencies but they had no way to claim that as there's. so now they have the ability to do that. been to those who give feedback on the building to use the data in realtime. is a somebody leaves accommodated responded realtime so that is an exciting pilot but they can
1:45 am
use that data in realtime. bin and looked at the digital analytics program it is like a google software to install on every web site to see the traffic to their site the browsers how long they stay edward they come from so something to your talk about but the program
1:46 am
to start off that gsa has been around a little over a year in their mission in is to help agencies design software. >> so we can negotiate terms that developers need and encourage agencies to have their own digital service teams so they are popping up
1:47 am
and that is exciting. one of the projects such as rolled out was the score card that was a great example to help bin agency unleashed data that with treasury and department of education and could come through 18 years of 8,000 different schools is and universities searching in that data in a meaningful way so people applying could make better decisions. a percentage of those do graduate, is starting salary comic it is exciting way to take these to work across the whitehouse to unleash the data in meaningful ways. the is of a couple of examples.
1:48 am
thank you. [applause] >> and i was say that not everyone was wildly enthusiastic of the passage bet it was an effort to move the federal government to increase transparency. but we have been leading the
1:49 am
effort and enormously impressed with how much progress we have made in the last year and i will talk about those but some of the key thing is is to convince the community there is a number of cases why the federal government needs to be more open and transparent and the federal spending area. previously there questions have to get access? that is a challenge that it is silo did so many different systems we could sit next to each other but you may have access to a system that i don't and we are in the same organization it would be better if we had access to the save information to make decisions. every talk about that with better decisions and better government.
1:50 am
the fed is the reason why the federal government needs to move forward how we're spending the taxpayers' dollars but they have a right to to know what we are doing going back to thomas jefferson talking about the average person should understand the finances of the united states government and where the spending is going the ig community has the same interest in those are important but they may not motivate themselves and that is why it is so important to talk to the federal community why it is important to be moving forward. so we will talk about accomplishments issuing the data standards this is a theme throughout by a discussion in the past
1:51 am
looking at a big initiative the want to build things and the big systems and spend a lot of time with a waterfall process along no way. we'll look at the data centric effort it exist already so it isn't to change that data but to of lockett and capture it so we've the get the various systems to find where it exists to extract it is presented in the way that they can use it is a different approach to a much less expensive. so if you have this process to do things quickly you
1:52 am
fail fast in read you learn quickly how things are going. we have a number of pilots under way and we're very pleased with the initial results to take the concept to execute to show that you can extract data from the systems where they reside and presented a meaningful way and those are eager to help us with that presentation of process end to -- and people will figure out ways to make them more usable but we see that with tools that are very inexpensive and that is encouraging. some data us centric we want to make sure that they owned
1:53 am
their data and the quality is good and then they can use the data one thing we've learned was it turns out agencies are using usa spending to get access to their own data they cannot get access to their own data because it is an configured in a way to have access to the ozone systems so that is taking the data to make it more accessible and decentralized way. treasury has put together a program one of the best practices you could do with a small group of people they said let's take a look at the 57 standards to see rather that data resides in
1:54 am
bin gave it to to every agency and said the figure out how you line up relative to the prototype and it seems they line up pretty well but if there resided to three systems which is the best for extraction in? now we're moving much further along in into say the best place is here the best place to extract this is the best source of truth. also gave put together the implementation plan the question can approve air how do they know what they're doing? put together a small team in figure out where the standards reside in preview
1:55 am
those and think what your broker will of the classic to be presented to the public at large now treasury is responsible for what does that presentation looks like? i would point out to to my colleagues with we hear today that both organizations have had a disproportionate impact relative to size with what they're doing compared to government to talk about user involvement really is fundamental to our ability to restorer trust in the federal government to access its formation easily and understand what the government is trying to do is incredibly important for i am not sure the public sees the work they're doing but i assure you that over
1:56 am
the next five for 10 years the impact will be enormous. is already impacting my organization how we looked at the surface is that we provide. so my generation shows how i have to adapt to a technology and i have to figure out how to use said. that is not the way the world is working out. now adapts to the user the only way you can do that is if you're out talking to people to say let's get this and tell us how to do this better. that is important that both have been at the heart of that to drive that thinking and beverages big it it takes time but as to look for red you will see the impact across government in
1:57 am
a positive way. the delays give me slides and i never follow them. but i will get to the invitation. this talks about the old way into a new way of reaching out to. what treasury will put for word shortly, we are interested what their next generation of usa's spending with the klay key and reoffering and invitation this doesn't have to be in place and tell may 2017 but that is not too soon to start.
1:58 am
so we are saying is to put this site together we want to know what the users want so we have some of mechanisms how government spends money can purchase the paper co first of all, this is the of indentation at the french but it is an example of the ability for the community to provide input to read what people to be here on a regular basis this is another way for people to say here is the different functionality but the next slide is the example that we have taken from usa's spending today that we want you to look at
1:59 am
the tools is it useful? the next five-year issue was since of the power as an example of looking of various lovell's you can see green comes to the department of transportation and money spent on highway planning and as we move down there is more of levels of a regularity and that is an aspect to drill down into a vertical way but horizontal a to see how spending is occurring we still have work to do on the data standards. klay said performance is of
2:00 am
a difficult issue of program means different things so they need to continue to work on those things but i do feel good of the of people that we haven't solved about that lovell of engagement and talking about that implementation plan to put the small teams together we have a couple of pilots showing great promise. . .

53 Views

info Stream Only

Uploaded by TV Archive on