Please use the following text to cite this item or export to a predefined format:
Centre for Language Technology, NorS, University of Copenhagen and The Danish Language Council, 2011, DK-CLARIN LSP Corpus - Nanotechnology domain, CLARIN-DK-UCPH Centre Repository, http://hdl.handle.net/20.500.12115/16.
dc.creatorOlsen, Sussi
dc.creatorBraasch, Anna
dc.creatorJakob, Halskov
dc.creatorHansen, Dorte Haltrup
dc.date.accessioned2018-06-08T11:44:03Z
dc.date.available2018-06-08T11:44:03Z
dc.date.issued2011
dc.descriptionTexts in the Nanotechnology domain come from iNano (Interdisciplinary Nanoscience Center, AU), Nano (DTU), Niels Bohr Institutet, Forskningscenter Risø, Ministeriet for Sundhed og Forebyggelse (via DTU), Miljøstyrelsen, Aktuel Naturvidenskab and have been collected in the DK-CLARIN project, WP2.2, 2008 - 2011. The corpus consists of 358,144 words in 157 files. Communicative setting/Number of files: expert->advanced (13) expert->basic (144) All texts are in XML TEIP5 format (TEIP5DKCLARIN-format), with tokenisation, sentence and paragrapgsegmentation, pos-tagging, lemmatisation and termhood annotation placed in separate text external spangroups. "DK-CLARIN LSP Corpus - Nanotechnology domain" is a part of the Danish DK-CLARIN LSP corpus consisting of seven sub-corpora from following subject domains: Agriculture, Construction, Economics, Environment, Health, IT and Nanotechnology.
dc.identifier.urihttp://hdl.handle.net/20.500.12115/16
dc.language.isodan
dc.publisherCentre for Language Technology, NorS, University of Copenhagen
dc.publisherThe Danish Language Council
dc.rightsCLARIN-ACA-NC
dc.rights.labelACA
dc.rights.urihttps://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&NORED=1
dc.subjectNanotechnology
dc.titleDK-CLARIN LSP Corpus - Nanotechnology domain
dc.typecorpus
local.annotationInfo.annotationTypetokenization
local.annotationInfo.annotationTypesentence and paragraph segmentation
local.annotationInfo.annotationTypePOS-tagging
local.annotationInfo.annotationTypelemmatization
local.annotationInfo.annotationTypetermhood scoring
local.brandingCLARIN-DK
local.contact.personAdministrator CLARIN-DK info@clarin.dk Centre for Language Technology, NorS, University of Copenhagen
local.files.count12
local.files.size17739501
local.has.filesyes
local.language.nameDanish
local.size.info358144 words
local.size.info157 files
local.sponsornationalFunds n/a n/a DK-CLARIN
metashare.ResourceInfo#ContentInfo.mediaTypetext
This item isAcademic Use
and licensed under:
 Files in this item
Name
text-format.pdf
Size
111.77 KB
Format
application/pdf
Description
Documentation
MD5
c4c4b5f1cd83ff232c44bc7692621da7
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
README_nano.txt
Size
3.08 KB
Format
text/plain
Description
readme
MD5
f31ed057aa67441521f84c2ee986380c
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
teiHeader.xsd
Size
59.88 KB
Format
text/xml
Description
Schema
MD5
9fc5374ad34319278f437b963454f972
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
textCorpusProfile.xsd
Size
142.26 KB
Format
text/xml
Description
Schema
MD5
7d6b452b88175041133ea8020e453cd8
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
text-header.pdf
Size
375.79 KB
Format
application/pdf
Description
Documentation
MD5
47825d0010a398bf10ce1564da2a15f0
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
DKCLARIN_fagsprogligt_korpus_dokumentation_2011.pdf
Size
361.81 KB
Format
application/pdf
Description
Documentation
MD5
e1752deaa6888e2f856811c8d933e655
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
AktuelNaturvidenskab.zip
Size
1.38 MB
Format
application/zip
Description
Corpus
MD5
d1f1e215b8f12667b46be3b23226d238
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
Nano_2.zip
Size
1.61 MB
Format
application/zip
Description
Corpus
MD5
c0ce205b4a468853cb165d3e67235677
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
dkclarin-LSPnano-cmdi_textCorpus.xml
Size
18 KB
Format
text/xml
Description
CMDI metadata
MD5
150e361f8f007336b07edf1f58a6b235
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
Nano_1.zip
Size
4.52 MB
Format
application/zip
Description
Corpus
MD5
e932cc5aee49ee0acb99c285ea7f5ccc
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
Nano_3.zip
Size
2.56 MB
Format
application/zip
Description
Corpus
MD5
35b8eb69e2c5d537829d3d4a35b830a2
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
Nano_4-5-6.zip
Size
5.8 MB
Format
application/zip
Description
Corpus
MD5
8cb823b745d30e0801abea47b816e75b
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator