Please use the following text to cite this item or export to a predefined format:
Centre for Language Technology, NorS, University of Copenhagen and The Danish Language Council, 2011, DK-CLARIN LSP Corpus - Health domain, CLARIN-DK-UCPH Centre Repository, http://hdl.handle.net/20.500.12115/14.
dc.creatorOlsen, Sussi
dc.creatorBraasch, Anna
dc.creatorJakob, Halskov
dc.creatorHansen, Dorte Haltrup
dc.date.accessioned2018-06-08T09:45:51Z
dc.date.available2018-06-08T09:45:51Z
dc.date.issued2011
dc.descriptionTexts in the Health and Medicine Domain come from netpatient.dk, Søfartsstyrelsen, Sundhedsstyrelsen, regionH, Libris, Aktuel Naturvidenskab and have been collected in the DK-CLARIN project, WP2.2, 2008 - 2011. The corpus consists of 3,972,573 words in 3273 files. Communicative setting/Number of files: expert->expert (27) expert->advanced (40) expert->basic (3206). All texts are in XML TEIP5 format (TEIP5DKCLARIN-format), with tokenisation, sentence and paragraph segmentation, pos-tagging, lemmatisation and termhood annotation placed in separate text external spangroups. "DK-CLARIN LSP Corpus - Health and Medicine domain" is a part of the Danish DK-CLARIN LSP corpus consisting of seven sub-corpora from following subject domains: Agriculture, Construction, Economics, Environment, Health, IT and Nanotechnology.
dc.identifier.urihttp://hdl.handle.net/20.500.12115/14
dc.language.isodan
dc.publisherCentre for Language Technology, NorS, University of Copenhagen
dc.publisherThe Danish Language Council
dc.rightsCLARIN-ACA-NC
dc.rights.labelACA
dc.rights.urihttps://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&NORED=1
dc.subjectHealth
dc.titleDK-CLARIN LSP Corpus - Health domain
dc.typecorpus
local.annotationInfo.annotationTypetokenization
local.annotationInfo.annotationTypesentence and paragraph segmentation
local.annotationInfo.annotationTypePOS-tagging
local.annotationInfo.annotationTypelemmatization
local.annotationInfo.annotationTypetermhood scoring
local.brandingCLARIN-DK
local.contact.personAdministrator CLARIN-DK info@clarin.dk Centre for Language Technology, NorS, University of Copenhagen
local.files.count15
local.files.size197536844
local.has.filesyes
local.language.nameDanish
local.size.info3972573 words
local.size.info3273 files
local.sponsornationalFunds n/a n/a DK-CLARIN
metashare.ResourceInfo#ContentInfo.mediaTypetext
This item isAcademic Use
and licensed under:
 Files in this item
Name
teiHeader.xsd
Size
59.88 KB
Format
text/xml
Description
Dokumentation
MD5
9fc5374ad34319278f437b963454f972
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
text-format.pdf
Size
111.77 KB
Format
application/pdf
Description
Dokumentation
MD5
c4c4b5f1cd83ff232c44bc7692621da7
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
dkclarin-health-cmdi_textCorpus.xml
Size
17.99 KB
Format
text/xml
Description
CMDI metadata
MD5
e0df14f4559ba3fe72ad53d86445aaa8
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
DKCLARIN_fagsprogligt_korpus_dokumentation_2011.pdf
Size
361.81 KB
Format
application/pdf
Description
Dokumentation
MD5
e1752deaa6888e2f856811c8d933e655
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
sundhed_dk_2.zip
Size
42.78 MB
Format
application/zip
Description
Corpus
MD5
54adcbe6ad4842bddfa8ff1683d11afd
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
SST.zip
Size
39.4 MB
Format
application/zip
Description
Corpus
MD5
a636edde53d1e826ba6b03423e45161c
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
text-header.pdf
Size
375.79 KB
Format
application/pdf
Description
Schema
MD5
47825d0010a398bf10ce1564da2a15f0
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
sundhed_dk_3.zip
Size
41.55 MB
Format
application/zip
Description
Corpus
MD5
6c5f9b5268c1b9e024c1c989fe6ef772
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
textCorpusProfile.xsd
Size
142.26 KB
Format
text/xml
Description
Schema
MD5
7d6b452b88175041133ea8020e453cd8
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
README_health.txt
Size
2.96 KB
Format
text/plain
Description
readme
MD5
b9828b2a71471d1b9b80b9b9c68d86f1
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
AktuelNaturvidenskab.zip
Size
3.22 MB
Format
application/zip
Description
Corpus
MD5
cfe2b468bb75de8bd5532e2896ef18c4
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
soefartsstyrelsen.zip
Size
1.07 MB
Format
application/zip
Description
Corpus
MD5
e49627827fee177e432161a1d59df3f8
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
regionH.zip
Size
12.46 MB
Format
application/zip
Description
Corpus
MD5
5b5a956362c910eab028342f183f4f77
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
libris_sundhed.zip
Size
18.53 MB
Format
application/zip
Description
Corpus
MD5
f803f4ae113ca140a1ffd744e8bbb43b
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator
Name
netpatient.zip
Size
28.33 MB
Format
application/zip
Description
Corpus
MD5
5c6b54208afe230115018b69e5c820f0
Preview
  File Preview
    The file preview has not been generated yet. Please try again later or contact the system administrator