We are delighted to announce that a new course designed to assist those with a non-technical background to perform TDM practices is now available online.
Introduction to Text and Data Mining contains six sections, and takes around six hours to complete:
Introduction to text mining
Do you speak TDM? Understanding the key concepts and areas of TDM
Text and data mining and licensing
What research support staff can do for text and data mining
Time to text mine: practical activities
Glossaries on text and data mining
This course was created as part of the EU-funded 'Opening Infrastructure for Text and Data Mining - OpenMinTeD' project, in collaboration with the Office of Scholarly Communication, Cambridge University Library.
Mike Jackson and Neil Chue Hong, Director at the Software Sustainability Institute, have been working to develop guidance for software deposit and preservation for different stakeholders, as part of the Jisc-funded 'Research Data Shared Service' pilot programme.
They have released draft versions of this guidance for feedback, and invite the comments of the RDA software community. Please note when following the link: the README.txt provides an overview of the material, along with the guiding principles. The CONTRIBUTE.txt provides instructions for commenting, which is being done via Google Docs for this first stage.
They are also running a software deposit and preservation workshop to discuss this guidance and to identify ways in which to drive forward the adoption of best practices in software deposit and preservation (details below).
Last month we highlighted the launch of Data Tree, the free, NERC-funded online course that covers all you need to know for research data management, along with ways to engage and share data with business, policymakers, media and the wider public.
You can now access three modules of the Data Tree research data management training course online:
Data Management: Context - a broad overview of key topics relating to data management, including some core terminology and an introduction to the research data lifecycle, reproducibility and data ethics.
Data Management: NERC - data management in the context of being a NERC-funded researcher. This module examines the policies that NERC adheres to, and introduces the five environmental data centres.
Data & Research: Working with the Media and Public - guidance on how to best present your research to non-scientific audiences, and how to build good communication networks to engage with the media in different formats and forums.
The full course will be released in mid-July, at which point completion certificates will be added to each module. A full course certificate will also be awarded to people who complete every module. Importantly, any work you do now will still contribute towards your certificates, so you can start exploring the course right away!
Access the course and learn more about the funders and supporters who are involved in Data Tree on the website.
You can now catch the second episode of the Open Science Community Utrecht's podcast series, The Road to Open Science. The episode, entitled ‘Collaboration is Key’, continues to follow the path to open science through the perspective of researchers from different disciplines, talking to people within the academic community about their research, initiatives or experiences in relation to open science.
Episode two features conversations with Daniel Lakens (Eindhoven University of Technology), Anita Eerland and Loek Brinkman (Open Science Community Utrecht), and Cambridge's own 'open champion', Kirstie Whitaker (Alan Turing Institute, UK), who discuss the most important role the university can play to foster open science.
Annemarie Hildegard Eckes, one of our Data Champions, has contributed to the Data Tree blog, discussing how the new online training course offers a more flexible approach to learning essential data management practice.
Her survey of fellow PhD students revealed that many at the end of their PhD felt lack of data training had slowed down their progress. Although they would advocate training at the beginning of the project, several felt their schedules and working habits prevented them from attending talks in person. Annemarie hopes that the Datatree platform, which can be accessed anywhere at any time, will remove some of the barriers to learning more about RDM.
Annemarie uses this figure in her RDM training talks, and believes 'training in RDM for us PhD students is useful for two reasons, firstly to learn the right habits and secondly to enhance productivity throughout the duration of the PhD.'
Smileys figure, Annemarie Hildegard Ecke 2018, used with permission
Workshops have been taking place around the country to accompany the launch of the Data Tree free online data management course. Each workshop has explored a key theme of the course in more detail.
There are still places available at the final two workshops of the series; these are open to all, and PhD students and early career researchers are particularly encouraged to come along.
Engaging with public policymakers University of Bristol
The Software Sustainability Institute has produced a report and a collection of guides on software deposit and preservation for researchers, research leaders and research data managers.
At this workshop they will bring together research data managers, digital repository vendors, publishers, policymakers and researchers. They will present their guidance, discuss software deposit and preservation from the perspectives of the foregoing stakeholders, and identify ways in which to drive forward the adoption of best practices in software deposit and preservation.
There is still time to submit an application to attend OpenCon 2018. Applications submitted early will be reviewed first, and the Organizing Committee may issue early acceptance decisions, especially to applicants who need visas.
Even if you are not planning to apply for this year’s OpenCon, you are still encouraged to visit the application form to provide an update on your work. Updates from alumni provide the community with invaluable information on what OpenCon alumni are accomplishing and the overall impact of the conference.
Have you ever wondered how you can start a company from freely available public resources?
EPOC (Entrepreneurial Postdocs of Cambridge) has teamed up with Repositive, One Nucleus, and ELIXIR UK to explore how public bioinformatics infrastructures support innovation in the life sciences and generate revenue.
The evening will include talks by ELIXIR UK describing the entrepreneurial environment of public data resources and the types of business models that can be leveraged to start-up a company. The Cambridge based company Repositive will act as a real-life example of a company operating in this innovation space and show how a collaborative and open approach can help to tackle the most challenging scientific questions. One Nucleus will conclude the evening with a talk on how industry associations can support academics, start-ups, SMEs and large multinational companies in their pursuit. The talks will be followed by a Q&A session in panel style chaired by Gabriella Rustici (Head for the Bioinformatics Training Programme, School of Biological Sciences, University of Cambridge & ELIXIR UK Training Coordinator).
A great opportunity to look at the latest technical updates and network with the best in UK academia and industry in the field of research software engineering. Stay on and help hack software citation with the #CitationFileFormat during a hack day on 5 September.
The 'Berkeley Initiative for Transparency in the Social Sciences' is accepting applications for self-funded participants. This 3-day workshop is designed for social science and public health science graduate students, postdocs, faculty and research professionals who are eager to learn more about cutting-edge mechanisms for transparent and reproducible research. The curriculum will cover the most relevant topics in research transparency, including pre-registration and pre-analysis plans; data preparation and de-identification; version control and dynamic documents; and other open science software.
The Office of Scholarly Communication sends this Newsletter to its subscribers in order to disseminate information relevant to research data management topics. For further information on our privacy policy please visit the University Information Compliance website.
When you book an event run by the Office of Scholarly Communication (OSC), you will automatically be added to our mailing lists to receive both the 'Research Data Management' and the OSC 'Kaleidoscope' monthly newsletters.
To unsubscribe or update your preferences, please visit the links at the bottom of this email.