Linguistic Data Consortium
![]() | |
Founded | 1992 |
---|---|
Headquarters | , |
Website | www |
The Linguistic Data Consortium is an open consortium of universities, companies and government research laboratories. It creates, collects and distributes speech and text databases, lexicons, and other resources for linguistics research and development purposes. The University of Pennsylvania is the LDC's host institution. The LDC was founded in 1992 with a grant from the US Defense Advanced Research Projects Agency (DARPA), and is partly supported by grant IRI-9528587 from the Information and Intelligent Systems division of the National Science Foundation.[1] The director of LDC is Mark Liberman.[2]
Part of the motivation was to support the benchmark-oriented methodology of DARPA's Human Language Technology program. Previously, John R. Pierce directed the committee that produced the ALPAC report (1966), which caused a severe decrease in funding for linguistic AI for about 10 years. Later, Charles Wayne restarted funding in speech and language in the mid-1980s. In order to avoid the criticisms from the ALPAC report, they needed a way to demonstrate objective progress, which led to the benchmark-oriented methodology. DARPA would propose specific quantifiable and testable score targets on benchmarks, and teams being funded would attempt to reach the score targets.[3][4][5]
See also
[edit]- Corpus linguistics
- Cross-Linguistic Linked Data (CLLD) – project coordinating over a dozen linguistics databases; hosted by the Max Planck Institute (Germany)
- Language Grid – a platform for language resources, operated by NPO Language Grid Association, primarily active in Asia
- Machine translation
- Natural language processing
- Speech technology
References
[edit]- ^ "About LDC". Linguistic Data Consortium. Retrieved June 18, 2024.
- ^ "Staff". Linguistic Data Consortium. Retrieved June 18, 2024.
- ^ Cieri, Christopher; Liberman, Mark; Cho, Sunghye; Strassel, Stephanie; Fiumara, James; Wright, Jonathan (2022-06). Calzolari, Nicoletta; Béchet, Frédéric; Blache, Philippe; Choukri, Khalid; Cieri, Christopher; Declerck, Thierry; Goggi, Sara; Isahara, Hitoshi; Maegaard, Bente (eds.). "Reflections on 30 Years of Language Resource Development and Sharing". Proceedings of the Thirteenth Language Resources and Evaluation Conference. Marseille, France: European Language Resources Association: 543–550.
{{cite journal}}
: Check date values in:|date=
(help) - ^ Church, Kenneth Ward (2018-01). "Emerging trends: A tribute to Charles Wayne". Natural Language Engineering. 24 (1): 155–160. doi:10.1017/S1351324917000389. ISSN 1351-3249.
{{cite journal}}
: Check date values in:|date=
(help) - ^ Liberman, Mark; Wayne, Charles (2020-06). "Human Language Technology". AI Magazine. 41 (2): 22–35. doi:10.1609/aimag.v41i2.5297. ISSN 0738-4602.
{{cite journal}}
: Check date values in:|date=
(help)