DK-CLARIN LSP Corpus - Health domain
Please use the following text to cite this item or export to a predefined format:
Centre for Language Technology, NorS, University of Copenhagen and The Danish Language Council, 2011,
DK-CLARIN LSP Corpus - Health domain, CLARIN-DK-UCPH Centre Repository,
http://hdl.handle.net/20.500.12115/14.
Authors
Item identifier
Date issued
2011
Size
3972573 words,
3273 files
Language(s)
Description
Texts in the Health and Medicine Domain come from netpatient.dk, Søfartsstyrelsen, Sundhedsstyrelsen, regionH, Libris, Aktuel Naturvidenskab and have been collected in the DK-CLARIN project, WP2.2, 2008 - 2011.
The corpus consists of 3,972,573 words in 3273 files.
Communicative setting/Number of files: expert->expert (27) expert->advanced (40) expert->basic (3206).
All texts are in XML TEIP5 format (TEIP5DKCLARIN-format), with tokenisation, sentence and paragraph segmentation, pos-tagging, lemmatisation and termhood annotation placed in separate text external spangroups.
"DK-CLARIN LSP Corpus - Health and Medicine domain" is a part of the Danish DK-CLARIN LSP corpus consisting of seven sub-corpora from following subject domains: Agriculture, Construction, Economics, Environment, Health, IT and Nanotechnology.
Acknowledgement
n/a
Project code:n/a
Project name:DK-CLARIN
Subject(s)
Collections
Files in this item
- Name
- teiHeader.xsd
- Size
- 59.88 KB
- Format
- text/xml
- Description
- Dokumentation
- MD5
- 9fc5374ad34319278f437b963454f972

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- text-format.pdf
- Size
- 111.77 KB
- Format
- application/pdf
- Description
- Dokumentation
- MD5
- c4c4b5f1cd83ff232c44bc7692621da7

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- dkclarin-health-cmdi_textCorpus.xml
- Size
- 17.99 KB
- Format
- text/xml
- Description
- CMDI metadata
- MD5
- e0df14f4559ba3fe72ad53d86445aaa8

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- DKCLARIN_fagsprogligt_korpus_dokumentation_2011.pdf
- Size
- 361.81 KB
- Format
- application/pdf
- Description
- Dokumentation
- MD5
- e1752deaa6888e2f856811c8d933e655

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- sundhed_dk_2.zip
- Size
- 42.78 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- 54adcbe6ad4842bddfa8ff1683d11afd

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- SST.zip
- Size
- 39.4 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- a636edde53d1e826ba6b03423e45161c

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- text-header.pdf
- Size
- 375.79 KB
- Format
- application/pdf
- Description
- Schema
- MD5
- 47825d0010a398bf10ce1564da2a15f0

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- sundhed_dk_3.zip
- Size
- 41.55 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- 6c5f9b5268c1b9e024c1c989fe6ef772

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- textCorpusProfile.xsd
- Size
- 142.26 KB
- Format
- text/xml
- Description
- Schema
- MD5
- 7d6b452b88175041133ea8020e453cd8

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- README_health.txt
- Size
- 2.96 KB
- Format
- text/plain
- Description
- readme
- MD5
- b9828b2a71471d1b9b80b9b9c68d86f1

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- AktuelNaturvidenskab.zip
- Size
- 3.22 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- cfe2b468bb75de8bd5532e2896ef18c4

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- soefartsstyrelsen.zip
- Size
- 1.07 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- e49627827fee177e432161a1d59df3f8

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- regionH.zip
- Size
- 12.46 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- 5b5a956362c910eab028342f183f4f77

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- libris_sundhed.zip
- Size
- 18.53 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- f803f4ae113ca140a1ffd744e8bbb43b

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- netpatient.zip
- Size
- 28.33 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- 5c6b54208afe230115018b69e5c820f0

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk

