Content deleted Content added
Kaltenmeyer (talk | contribs) m clean up, typo(s) fixed: european → European (2), long distance → long-distance, long term → long-term, publicly- → publicly , privately- → privately (5), softwares → software, infratructures → infrastructures (3), 2007-2 |
m Typo Tags: Visual edit Mobile edit Mobile web edit |
||
(9 intermediate revisions by 6 users not shown) | |||
Line 3:
[[File:Open science pillars.png|thumb|upright=1.35|Open Science infrastructure is one of the four pillars of Open Science in the UNESCO Recommendation on Open Science (2021).]]
'''Open Science Infrastructure''' (or ''open scholarly infrastructure'') is
Open science infrastructures are a form of scientific infrastructure (also called ''[[cyberinfrastructure]]'', ''[[e-Science]]'' or ''e-infrastructure'') that support the production of open knowledge. Beyond the management of common resources, they are frequently structured as community-led initiatives with a set collective norms and governance regulations, which makes them also a form of [[knowledge commons]]. The definition of open science infrastructures usually exclude privately owned scientific infrastructures run by leading commercial publishers. Conversely it may include actors not always characterized as scientific infrastructures that play a critical role in the ecosystem of open science, such as publishing platforms in open access (''open scholarly communication service'').
Line 14:
''Open science infrastructure'' is a form of knowledge infrastructure that makes it possible to create, publish and maintain open scientific outputs such as publication, data or software.
===Infrastructure===
The use of the term "infrastructure" is an explicit reference to the physical infrastructures and networks such as power grids, road networks or telecommunications that made it possible to run complex economic and social system after the industrial revolution: "The term infrastructure has been used since the 1920s to refer collectively to the roads, power grids, telephone systems, bridges, rail lines, and similar public works that are required for an industrial economy to function (
Open science infrastructure have specific properties that contrast them with other forms of open science projects or initiatives:
Line 39:
The ''Principles'' attempt to hybridize the framework of infrastructure studies with the analysis of the [[commons]] initiated by [[Elinor Ostrom]]. The principles develop a series of recommendations in three critical areas to the success of open infrastructures:
* '''Governance''': the governance of the infrastructure should be open and accountable to the scientific communities it aims to serve. Specific measures should ensure that the management of the organization is transparent and diverse.{{sfn|Bilder|Lin|Neylon|2015}}
* '''Sutainability''': the core activities of organization should be covered by recurring funds. Short-term subventions should be limited to short-term projects.
* '''Insurance''': the technical infrastructure and the output of the organization are open. This ensure that the infrastructure can be recreated if necessary (in the jargon of open source, it becomes "forkable").{{sfn|Bilder|Lin|Neylon|2015}}
Line 55:
Scientific projects have been among the earliest use case for digital infrastructure. The theorization of scientific knowledge infrastructure even predates the development of computing technologies. The knowledge network envisioned by [[Paul Otlet]] or [[Vannevar Bush]] already incorporated numerous features of online scientific infrastructures.{{sfn|Borgman|2007|p=40}}
After the Second World War, the United States faced a "periodical crisis": existing journals could not keep up with the rapidly increasing scientific output.{{sfn|Wouters|1999|p=61}} The issue became politically relevant after the successful launch of [[Sputnik]]: "The Sputnik crisis turned the librarians’ problem of bibliographic control into a national information crisis."{{sfn|Wouters|1999|p=62}} The emerging computing technologies were immediately considered as a potential solution to make a larger amount of scientific output readable and searchable. Access to foreign language publication was also a key issue that was expected to be solved by [[machine translation]]: in the 1950s, a significant amount of scientific publications [[Languages of Science|were not available in English]], especially the one coming from the Soviet
Influent members of the [[National Science Foundation]] like [[Joshua Ledeberg]] advocated for the creation of a "centralized information system", [[SCITEL]] that would at first coexist with printed journals and gradually replace them altogether on account of its efficiency.{{sfn|Wouters|1999|p=60}} In the plan laid out by Ledeberg to Eugen Garfield in November 1961, the deposit would index as much as 1,000,000 scientific articles per year. Beyond full-text searching, the infrastructure would also ensure the indexation of citation and other metadata, as well as the automated translation of foreign language articles.{{sfn|Wouters|1999|p=64}}
Line 88:
Several competing terms appeared to fill this need. In the United States, the ''cyber-infrastructure'' was used in a scientific context by a US National Science Foundation (NSF) blue-ribbon committee in 2003: "The newer term cyberinfrastructure refers to infrastructure based upon distributed computer, information and communication technology. If infrastructure is required for an industrial economy, then we could say that cyberinfrastructure is required for a knowledge economy."{{sfn|Atkins|2003|p=5}} E-infrastructure or e-science were used in a similar meaning in the United Kingdom and European countries.
Thanks to "sizable investments",{{sfn|Eccles et al.|2009}} major national and international infrastructures have been incepted from the initial policy discussion in the early 2000s to the economic crisis of 2007–2008, such as the [[Open Science Grid]], [[BioGRID]], the [[Jisc|JISC]], {{ill|DARIAH|
By 2010, infrastructure are "no longer in infancy" and yet "they are also not yet fully mature".{{sfn|Eccles et al.|2009}} While the development of the web solved a large range of technical issues regarding network management, building scientific infrastructure remained challenging. Governance, communication across all involved stakeholders, and strategical divergences were major factors of success or failure. One of the first major infrastructure for the humanities and the social science, the [[Project Bamboo]] was ultimately unable to achieve its ambitious aims: "From the early planning workshops to the [[Mellon Foundation]]
[[File:Providers of digital tools for the scientific workflow.png|thumb|Leading commercial ecosystems for scientific research]]
Leading commercial publishers were initially distanced by the unexpected rise of the Web for academic publication: the executive board of [[Elsevier]] "had failed to grasp the significance of electronic publishing altogether, and therefore the deadly danger that it posed—the danger, namely, that scientists would be able to manage without the journal".{{sfn|Andriesse|2008|pp=257-258}} The persistence of high revenues from subscription and the consolidation of the sector made it possible to fund the conversion of the pre-existing online services to the web as well as the digitization of past collections. By the 2010s, leading publishers have been "moving from a content-provision to a data analytics business"<ref name="andressi_5">{{harvnb|Aspesi et al.|2019|p=5}}</ref> and developed or acquired new key infrastructures for the management scientific and pedagogic activities: "Elsevier has acquired and launched products that extend its influence and its ownership of the infrastructure to all stages of the academic knowledge production process".{{sfn|Posada|Chen|2018|p=6}} Since it has expanded beyond publishing, the ''vertical integration'' of privately owned infrastructures has become extensively integrated to daily research activities.
{{blockquote|The privatised control of scholarly infrastructures is especially noticeable in the context of ‘vertical integration’ that publishers such as Elsevier and SpringerNature are seeking by controlling all aspects of the research life cycle, from submission to publication and beyond. For example, this vertical integration is represented in a number of
=== Toward open science infrastructures (2015-…) ===
Line 108:
Since 2015 these principles have become the most influential definition of Open Science Infrastructures and been endorsed by leading infrastructures such as Crossref,{{sfn|Bilder|2020}} OpenCitations{{sfn|Di Giambattista|2021}} or Data Dryad{{sfn|The Dryad Team|2020}} and has become a common basis for the institutional evaluation of existing open infrastructures.{{sfn|Ficarra et al.|2020|p=21}} The main focus of the ''Principles'' is to build "trustworthy institutions" with significant commitments in terms of governance, financial sustainability and technical efficiency sot that it can be durably relied on by scientific communities.{{sfn|Neylon|2017|p=7}}
By 2021, public services and infrastructures for research have largely endorsed open science as an integral part of their activity and identity: "open science is the dominant discourse to which new online services for research refer."{{sfn|Fecher et al.|2021|p=505}} According to the 2021 Roadmap of the {{ill|European Strategy Forum on Research Infrastructures|
In agreement with the original intent of the ''Principles'', open science infrastructure are "seen as an antidote to the increased market concentration observed in the scholarly communication space."{{sfn|Kraker|2021|p=2}} In November 2021, the UNESCO Recommendation for Open Science acknowledged open science infrastructure as one of the four pillar of open science, along with open science knowledge, open engagement of societal actors and open dialog with other knowledge system and called for sustained investment and funding: "open science infrastructures are often the result of community-building efforts, which are crucial for their longterm sustainability and therefore should be not-for-profit and guarantee permanent and unrestricted access to all public to the largest extent possible."{{sfn|UNESCO|2021}}
Line 122:
Open Access repositories are the most frequent form of Open Science Infrastructure<ref>{{harvnb|Operas Landscape Study|2017|p=15}}</ref> with 5,791 repositories in existence in December 2021 according to OpenDOAR{{sfn|OpenDOAR Statistics}}
Yet, there is a significant diversification of the roles and the activities of open science infrastructure, at least among the largest infrastructures. In the survey of European infrastructure conducted by SPARC Europe, 95% of the respondents mention that they provide services in at least three different stages of research production out of six (Creation, Evaluation, Publishing, Hosting, Discovering and Archiving).{{sfn|Ficarra et al.|2020|p=13}}
Specialization does happen at a higher level. A network analysis identifies "two main clusters of activities":
Line 154:
=== Economics ===
Many Open Science Infrastructure run "at a relatively low cost" as small infrastructures are an important part of the open science ecosystem.{{sfn|Ficarra et al.|2020|p=35}} In 2020, 21 out of 53 surveyed European infrastructures "report spending less than €50,000".{{sfn|Ficarra et al.|2020|p=35}} Consequently, more than 75% of surveyed European infrastructures are run by small teams of 5 FTEs or less.<ref>{{harvnb|Ficarra et al.|2020|p=41}}</ref> The size of the infrastructure and the extent of its funding is far from always proportional to the critical service it offers: "some of the most heavily used services make ends meet with a tiny core team of two to five people."<ref>{{harvnb|Kraker|2021|p=3}}</ref> Volunteer contributions are significant as well with is both "a strength and weakness to an
Overall, European infrastructures were financially sustainable in 2020<ref>{{harvnb|Ficarra et al.|2020|p=51}}</ref> which contrasts with the situation ten years prior: in 2010, European infrastructures had much less visibility: they usually lacked "a long-term perspective" and struggled "with securing the funding for more than 5 years".{{sfn|eResearch2020|2010|p=103}} In 2020, European infrastructures frequently relies on grants from National funds and from the European Commission.{{sfn|Ficarra et al.|2020|p=45}} Without theses grants, most of theses actors would "could only remain viable for less than a year".{{sfn|Ficarra et al.|2020|p=48}} Yet, one quarter of surveyed European infrastructures was not supported by any grants and subventions and used either alternative means of incomes or voluntary contributions.{{sfn|Ficarra et al.|2020|p=35}} As they can be "difficult to define adequately", open science infrastructures can be overlooked by funding bodies, which "contributes to the challenge of securing funding".<ref>{{harvnb|Neylon|2017|p=1}}</ref>
Line 165:
=== Definitions ===
* {{Cite
** {{Cite web |vauthors=Bilder G, Lin J, Neylon C |title=The Principles of Open Scholarly Infrastructure| date=2020| doi=10.24343/C34W2H| doi-access=free |access-date=2021-11-01| url=https://openscholarlyinfrastructure.org/| via=The Principles of Open Scholarly Infrastructure}}
* {{Cite web |ref={{harvid|SPARC|2020}}| last1= SPARC| last2= COAR| date=2019| title = Good Practice Principles for Scholarly Communication Services| work = SPARC| accessdate = 2021-12-12| url = https://sparcopen.org/our-work/good-practice-principles-for-scholarly-communication-services/}}
Line 177:
* {{Cite report |last=Lewis| first=David W.| title=Mapping Scholarly Communication Infrastructure: A Bibliographic Scan of Digital Scholarly Communication Infrastructure| date=May 2020| ___location=Atlanta, GA| publisher=Educopia Institute| url=https://scholarworks.iupui.edu/server/api/core/bitstreams/cee09afc-db34-42f5-840b-be44338ed691/content| access-date=2021-12-12}}
*{{Cite report| author=((eResearch2020))| publisher = European Commission| title = The role of e-Infrastructures in the creation of global virtual research communities| ___location = Brussels| date = 2010|url = https://op.europa.eu/en/publication-detail/-/publication/edf0fed4-c01a-454b-8a9e-34f602b00100}}
* {{Cite report |ref={{harvid|Operas Landscape Study|2017}}| publisher = OPERAS| title = Landscape Study on Open Access Publishing| series = Design for Open Access Publications in European Research Areas for Social Sciences and Humanities| date = 2017| doi=10.3030/731031 |url=https://cordis.europa.eu/project/id/731031/results| url-access = subscription}}
* {{Cite report| last1= Chodacki| first1= John| last2= Cruse| first2= Patricia| last3= Lin| first3= Jennifer| last4= Neylon| first4= Cameron| last5= Pattinson| first5= Damian| last6= Strasser| first6= Carly| title = Supporting Research Communications: a guide| accessdate = 2021-12-11| date = 2018-04-05| url = https://zenodo.org/record/3524663}}
*{{Cite report |ref={{harvid|Aspesi et al.|2019}}| publisher = LIS Scholarship Archive| last1= Aspesi| first1= Claudio| last2= Allen| first2= Nicole Starr| last3= Crow| first3= Raym| last4= Daugherty| first4= Shawn| last5= Joseph| first5= Heather| last6= McArthur| first6= Joseph| last7= Shockey| first7= Nick| title = SPARC Landscape Analysis: The Changing Academic Publishing Industry – Implications for Academic Institutions| accessdate = 2022-01-05| date = 2019-04-03| url = https://osf.io/preprints/lissa/58yhb/}}
Line 198:
* {{Cite book | publisher = Brill| isbn = 978-90-04-17084-1| last = Andriesse| first = Cornelis D.| title = Dutch Messengers: A History of Science Publishing, 1930–1980| ___location = Leiden; Boston| date = 2008-09-15}}
*{{Cite book | publisher = OUP Oxford| isbn = 978-0-19-956113-1| last1= Bygrave| first1= Lee A.| last2= Bing| first2= Jon| title = Internet Governance: Infrastructure and Institutions| date = 2009-01-22}}
* {{Cite book | publisher = Peter Lang| pages = 29–41| editor = Frédéric Clavert |editor2=Serge Noiret| last = Dacos| first = Marin| title = L'histoire contemporaine à l'ère contemporain| chapter = Cyberclio : vers une cyberinfrastructure au cœur de la discipline historique| ___location = Bruxelles; Bern; Berlin; Frankfurt am Main; New York; Oxford; Wien| date = 2013| language=fr| trans-title=Contemporary History in the Digital Age| trans-chapter=Cyberclio. Towards a Cyberinfrastructure at the heart of the historical discipline| url=https://www.academia.edu/4558796
*{{Cite book | publisher = IOS Press| isbn = 978-1-61499-383-4| last = Hogan| first = A.| title = Reasoning Techniques for the Web of Data| date = 2014-04-09}}
*{{Cite book | publisher = Rowman & Littlefield| isbn = 978-0-8108-9088-6| last = Regazzi| first = John J.| title = Scholarly Communications: A History from Content as King to Content as Kingmaker| date = 2015-02-12}}
Line 215:
*{{Cite journal | doi = 10.1057/jit.2013.4| issn = 0268-3962| volume = 28| issue = 1| pages = 18–33| last1= Campbell-Kelly| first1= Martin| last2= Garcia-Swartz| first2= Daniel D| title = The History of the Internet: The Missing Narratives| journal = Journal of Information Technology| date = 2013| s2cid = 41013}}
* {{Cite journal |last=Dombrowski| first=Quinn| title=What Ever Happened to Project Bamboo?| journal=Literary and Linguistic Computing| volume=29| issue=3| access-date=2021-12-22| date=2014-06-16| pages=326–339| doi=10.1093/llc/fqu026| url=https://escholarship.org/uc/item/6jq660tm}}
* {{Cite journal| last=Cassella| first=Maria| title=Piattaforme digitali per la pubblicazione di contenuti di ricerca: esperienze, modelli open access, tendenze| journal=Biblioteche
* {{Cite journal |ref={{harvid|Karasti et al. I|2016}}| doi = 10.23987/sts.55406| issn = 2243-4690| volume = 29| issue = 1| pages = 2–12| last1= Karasti| first1= Helena| last2= Millerand| first2= Florence| last3= Hine| first3= Christine M.| last4= Bowker| first4= Geoffrey C.| title = Knowledge Infrastructures: Part I| journal = Science & Technology Studies| date=2016-02-12| doi-access = free}}
* {{Cite journal |ref={{harvid|Karasti et al. IV|2016}}| doi = 10.23987/sts.60220| issn = 2243-4690| volume = 29| issue = 4| pages = 2–9| last1= Karasti| first1= Helena| last2= Millerand| first2= Florence| last3= Hine| first3= Christine M.| last4= Bowker| first4= Geoffrey C.| title = Knowledge Infrastructures: Part IV| journal = Science & Technology Studies| date=2016-12-14| doi-access = free}}
Line 231:
* {{Cite journal| doi = 10.1093/joc/jqz052| issn = 0021-9916| volume = 71| issue = 1| pages = 1–26| last1= Dienlin| first1= Tobias| last2= Johannes| first2= Niklas| last3= Bowman| first3= Nicholas David| last4= Masur| first4= Philipp K.| last5= Engesser| first5= Sven| last6= Kümpel| first6= Anna Sophie| last7= Lukito| first7= Josephine| last8= Bier| first8= Lindsey M| last9= Zhang| first9= Renwen| last10= Johnson| first10= Benjamin K.| last11= Huskey| first11= Richard| last12= Schneider| first12= Frank M.| last13= Breuer| first13= Johannes| last14= Parry| first14= Douglas A.| last15= Vermeulen| first15= Ivar| last16= Fisher| first16= Jacob T.| last17= Banks| first17= Jaime| last18= Weber| first18= René| last19= Ellis| first19= David A| last20= Smits| first20= Tim| last21= Ivory| first21= James D| last22= Trepte| first22= Sabine| last23= McEwan| first23= Bree| last24= Rinke| first24= Eike Mark| last25= Neubaum| first25= German| last26= Winter| first26= Stephan| last27= Carpenter| first27= Christopher J.| last28= Krämer| first28= Nicole| last29= Utz| first29= Sonja| last30= Unkel| first30= Julian| last31= Wang| first31= Xiaohui| last32= Davidson| first32= Brittany I.| last33= Kim| first33= Nuri| last34= Won| first34= Andrea Stevenson| last35= Domahidi| first35= Emese| last36= Lewis| first36= Neil A.| last37= de Vreese| first37= Claes| title = An Agenda for Open Science in Communication| journal = Journal of Communication| date = February 2021| hdl = 10919/99938| hdl-access = free}}
* {{Cite journal| doi=10.7771/2380-176X.8409| issn=2380-176X| volume=31| issue=5| last=Vandegrift| first=Micah| title=The Golden Age of the Green Ecosystem: A Color-BlindPerspective on Repositories| journal=Against the Grain| date=2021-03-01| s2cid=233797804| doi-access=free}}
*{{Cite journal |doi=10.5860/crln.82.6.265| last=Boston| first=A. J.| title=Thinking politically about scholarly infrastructure: Commit the publishers to 2.5% |journal=College & Research Libraries News| date=2021-06-04| volume=82| issue=6| page=265| doi-access=free}}
*{{Cite journal| ref={{harvid|Fecher et al.|2021}}| doi=10.1093/scipol/scab026| issn=0302-3427| volume=48| issue=4| pages=499–507| last1=Fecher| first1=Benedikt| last2=Kahn| first2=Rebecca| last3=Sokolovska| first3=Nataliia| last4=Völker| first4=Teresa| last5=Nebe| first5=Philip| title=Making a Research Infrastructure: Conditions and Strategies to Transform a Service into an Infrastructure| journal=Science and Public Policy| date=2021-08-01| doi-access=free}}
*{{Cite journal |doi=10.21428/6ffd8432.a1d2856b| doi-access=free| volume=1| issue=1| last=Kraker| first=Peter| title=Now is the Time to Fund Open Infrastructures| journal=Commonplace| date=2021-08-16}}
Line 249:
* {{Cite web| title = The end of the journal? What has changed, what stayed the same?| last=Neylon| first=Cameron| date=2015-11-29| work=Science in the Open| accessdate = 2021-10-31| url = http://cameronneylon.net/blog/the-end-of-the-journal-what-has-changed-what-stayed-the-same/}}
* {{Cite web| last = Guédon| first = Jean-Claude| title = Open Access: Toward the Internet of the Mind| work = BOAI| url=https://www.budapestopenaccessinitiative.org/boai15/open-access-toward-the-internet-of-the-mind/| access-date=2021-12-12}}
* {{cite
* {{cite web |author=The Dryad Team |date=2020-12-08 |url=https://blog.datadryad.org/2020/12/08/dryads-commitment-to-the-principles-of-open-scholarly-infrastructure/ |title=
* {{cite web |title=Open Science MOOC Response to UNESCO Draft Open Science Recommendations |author=((Open Science MOOC 2020 Steering Committee)) |date=December 30, 2020 |url=https://en.unesco.org/sites/default/files/comments_osr_partner_open_science_mooc_document.pdf}}
* {{cite web |last=Di Giambattista |first=Chiara |date=2021-08-09 |url=https://opencitations.wordpress.com/2021/08/09/opencitations-compliance-with-the-principles-of-open-scholarly-infrastructure/ |title=
{{refend}}
|