Please use the following text to cite this item or export to a predefined format:
Centre for Language Technology, NorS, University of Copenhagen and The Danish Language Council, 2011,
DK-CLARIN LSP Corpus - Health domain, CLARIN-DK-UCPH Centre Repository,
http://hdl.handle.net/20.500.12115/14.
| dc.creator | Olsen, Sussi |
| dc.creator | Braasch, Anna |
| dc.creator | Jakob, Halskov |
| dc.creator | Hansen, Dorte Haltrup |
| dc.date.accessioned | 2018-06-08T09:45:51Z |
| dc.date.available | 2018-06-08T09:45:51Z |
| dc.date.issued | 2011 |
| dc.description | Texts in the Health and Medicine Domain come from netpatient.dk, Søfartsstyrelsen, Sundhedsstyrelsen, regionH, Libris, Aktuel Naturvidenskab and have been collected in the DK-CLARIN project, WP2.2, 2008 - 2011. The corpus consists of 3,972,573 words in 3273 files. Communicative setting/Number of files: expert->expert (27) expert->advanced (40) expert->basic (3206). All texts are in XML TEIP5 format (TEIP5DKCLARIN-format), with tokenisation, sentence and paragraph segmentation, pos-tagging, lemmatisation and termhood annotation placed in separate text external spangroups. "DK-CLARIN LSP Corpus - Health and Medicine domain" is a part of the Danish DK-CLARIN LSP corpus consisting of seven sub-corpora from following subject domains: Agriculture, Construction, Economics, Environment, Health, IT and Nanotechnology. |
| dc.identifier.uri | http://hdl.handle.net/20.500.12115/14 |
| dc.language.iso | dan |
| dc.publisher | Centre for Language Technology, NorS, University of Copenhagen |
| dc.publisher | The Danish Language Council |
| dc.rights | CLARIN-ACA-NC |
| dc.rights.label | ACA |
| dc.rights.uri | https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&NORED=1 |
| dc.subject | Health |
| dc.title | DK-CLARIN LSP Corpus - Health domain |
| dc.type | corpus |
| local.annotationInfo.annotationType | tokenization |
| local.annotationInfo.annotationType | sentence and paragraph segmentation |
| local.annotationInfo.annotationType | POS-tagging |
| local.annotationInfo.annotationType | lemmatization |
| local.annotationInfo.annotationType | termhood scoring |
| local.branding | CLARIN-DK |
| local.contact.person | Administrator CLARIN-DK info@clarin.dk Centre for Language Technology, NorS, University of Copenhagen |
| local.files.count | 15 |
| local.files.size | 197536844 |
| local.has.files | yes |
| local.language.name | Danish |
| local.size.info | 3972573 words |
| local.size.info | 3273 files |
| local.sponsor | nationalFunds n/a n/a DK-CLARIN |
| metashare.ResourceInfo#ContentInfo.mediaType | text |
Collections
Files in this item
- Name
- teiHeader.xsd
- Size
- 59.88 KB
- Format
- text/xml
- Description
- Dokumentation
- MD5
- 9fc5374ad34319278f437b963454f972

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- text-format.pdf
- Size
- 111.77 KB
- Format
- application/pdf
- Description
- Dokumentation
- MD5
- c4c4b5f1cd83ff232c44bc7692621da7

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- dkclarin-health-cmdi_textCorpus.xml
- Size
- 17.99 KB
- Format
- text/xml
- Description
- CMDI metadata
- MD5
- e0df14f4559ba3fe72ad53d86445aaa8

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- DKCLARIN_fagsprogligt_korpus_dokumentation_2011.pdf
- Size
- 361.81 KB
- Format
- application/pdf
- Description
- Dokumentation
- MD5
- e1752deaa6888e2f856811c8d933e655

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- sundhed_dk_2.zip
- Size
- 42.78 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- 54adcbe6ad4842bddfa8ff1683d11afd

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- SST.zip
- Size
- 39.4 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- a636edde53d1e826ba6b03423e45161c

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- text-header.pdf
- Size
- 375.79 KB
- Format
- application/pdf
- Description
- Schema
- MD5
- 47825d0010a398bf10ce1564da2a15f0

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- sundhed_dk_3.zip
- Size
- 41.55 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- 6c5f9b5268c1b9e024c1c989fe6ef772

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- textCorpusProfile.xsd
- Size
- 142.26 KB
- Format
- text/xml
- Description
- Schema
- MD5
- 7d6b452b88175041133ea8020e453cd8

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- README_health.txt
- Size
- 2.96 KB
- Format
- text/plain
- Description
- readme
- MD5
- b9828b2a71471d1b9b80b9b9c68d86f1

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- AktuelNaturvidenskab.zip
- Size
- 3.22 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- cfe2b468bb75de8bd5532e2896ef18c4

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- soefartsstyrelsen.zip
- Size
- 1.07 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- e49627827fee177e432161a1d59df3f8

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- regionH.zip
- Size
- 12.46 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- 5b5a956362c910eab028342f183f4f77

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- libris_sundhed.zip
- Size
- 18.53 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- f803f4ae113ca140a1ffd744e8bbb43b

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk
- Name
- netpatient.zip
- Size
- 28.33 MB
- Format
- application/zip
- Description
- Corpus
- MD5
- 5c6b54208afe230115018b69e5c820f0

The file preview has not been generated yet. Please try again later or contact the system administrator info@clarin.dk

