Anastasia Escher
Swiss Federal Institute of Technology (ETH), Nexus, Department Member
In this paper, we present a corpus for heritage Bosnian/Croatian/Montenegrin/Serbian (BCMS) spoken in German-speaking Switzerland. The corpus consists of elicited conversations between 29 second-generation speakers originating from... more
In this paper, we present a corpus for heritage Bosnian/Croatian/Montenegrin/Serbian (BCMS) spoken in German-speaking Switzerland. The corpus consists of elicited conversations between 29 second-generation speakers originating from different regions of former Yugoslavia. In total, the corpus contains 30 turn-aligned transcripts with an average length of 6 min. It is enriched with extensive speakers’ metadata, annotations, and pre-calculated corpus counts. The corpus can be accessed through an interactive corpus platform that allows for browsing, querying, and filtering, but also for creating and sharing custom annotations. Principal user groups we address with this corpus are researchers of heritage BCMS, as well as students and teachers of BCMS living in diaspora. In addition to introducing the corpus platform and the workflows we adopted to create it, we also present a case study on BCMS spoken by a pair of siblings who participated in the map task, and discuss advantages and chal...
Research Interests:
In this paper, we present a corpus for heritage Bosnian/Croatian/Montenegrin/Serbian (BCMS) spoken in German-speaking Switzerland. The corpus consists of elicited conversations between 29 second-generation speakers originating from... more
In this paper, we present a corpus for heritage Bosnian/Croatian/Montenegrin/Serbian (BCMS) spoken in German-speaking Switzerland. The corpus consists of elicited conversations between 29 second-generation speakers originating from different regions of former Yugoslavia. In total, the corpus contains 30 turn-aligned transcripts with an average length of 6 min. It is enriched with extensive speakers' metadata, annotations, and pre-calculated corpus counts. The corpus can be accessed through an interactive corpus platform that allows for browsing, querying, and filtering, but also for creating and sharing custom annotations. Principal user groups we address with this corpus are researchers of heritage BCMS, as well as students and teachers of BCMS living in diaspora. In addition to introducing the corpus platform and the workflows we adopted to create it, we also present a case study on BCMS spoken by a pair of siblings who participated in the map task, and discuss advantages and challenges of using this corpus platform for linguistic research.
Research Interests:
Research Interests:
The study deals with lexical and syntactic features of the Pope Gregory XI’s consistorial bull to the Order of Saint Paul the First Hermit, written in Avignon in 1371 and translated from Latin into Old Croatian (Čakavian). The focus is on... more
The study deals with lexical and syntactic features of the Pope Gregory XI’s consistorial bull to the Order of Saint Paul the First Hermit, written in Avignon in 1371 and translated from Latin into Old Croatian (Čakavian). The focus is on the degree of consciousness of the translator’s work, which does not imply a word for word (or even morpheme for morpheme) translation, but rather work with the semantics of the original text and careful selection of language elements. The analyzed examples demonstrate 1) variability of translation of lexemes and syntactic constructions (cf. for example, Latin persona ‘person’ and Old Croatian prelat ‘prelate’, stroitelь ‘administrator; abbot, prior, superior’ and oblastnik ‘administrator, governor’); 2) generalization of meaning of Latin lexemes (cf. for example, Latin antistes ‘bishop, abbot or prior’, episcopus ‘bishop’ and pontifex ‘pontiff’ and Old Croatian biskup ‘bishop’); 3) possibility of translating Latin tokens with phrases (cf., for exa...
Research Interests:
Research Interests:
A corpus-based method for assessing a range of dialect-standard variation is presented for identifying samples exhibiting the highest prevalence of dialect features. This method provides insight into areal and inter-speaker variation and... more
A corpus-based method for assessing a range of dialect-standard variation is presented for identifying samples exhibiting the highest prevalence of dialect features. This method provides insight into areal and inter-speaker variation and allows the extraction of maximally non-standard manifestations of the dialect, which may then be sampled and used for the study of language change and variation. The focus is on a non-standard Torlak variety, which has undergone considerable change under the influence of standard Serbian. The degree of variation is assessed by measuring the frequencies of five distinguishing linguistic features: accent position, dative reflexive si, auxiliary omission in the compound perfect, the post-positive article, and analytic case marking in the indirect object and possessive. Locations subject to the greatest and least influence of the standard are revealed using hierarchical clustering. A positive correlation between the frequencies of occurrence reveals whi...
Research Interests:
Der Beitrag beschaftigt sich mit der Etymologie des Namens des Hauptgotzen der Rugenslawen und pladiert dafur, das er ursprunglich Svantevid geheisen haben mus. Dies ergibt sich aus einer sorgfaltigen Analyse des Kultes im... more
Der Beitrag beschaftigt sich mit der Etymologie des Namens des Hauptgotzen der Rugenslawen und pladiert dafur, das er ursprunglich Svantevid geheisen haben mus. Dies ergibt sich aus einer sorgfaltigen Analyse des Kultes im indogermanischen Kontext, dessen Hauptfunktion das Weissagen ist.
Telling apart family-internal developments from contact-induced changes and socio-geographically conditioned areal trends is still one of the main challenges in tracing the development – and stability – of languages. Notably prima facie... more
Telling apart family-internal developments from contact-induced changes and socio-geographically conditioned areal trends is still one of the main challenges in tracing the development – and stability – of languages. Notably prima facie contact phenomena might turn out to have resulted from the interaction of a variety of sources upon closer inspection; assessing their interaction is a crucial requirement for comprehending the dynamics of change. Based on the example of simplification patterns in the system of short personal pronouns observed in the Balkan Slavic dialects located in the territory of North Macedonian and Greece, the present paper illustrates the necessity of identifying the multiplicity of triggers and their interaction, thereby making a case for the relevance of small-scale, transient patterns in understanding diachronic processes. In particular, it places the simplifications within the pronominal system into the larger context of differential object indexing by pre...
Research Interests:
The supporting information gives additional details on the related work and the methods and provides the full experimental results for the simulation study and the two case studies.
Summary Idioms of the Torlak dialect (spoken in southeast Serbia and western Bulgaria) are known for their “double affiliation”. On the one hand, by virtue of their historical and phonetic features, they belong to the western range of the... more
Summary Idioms of the Torlak dialect (spoken in southeast Serbia and western Bulgaria) are known for their “double affiliation”. On the one hand, by virtue of their historical and phonetic features, they belong to the western range of the South Slavic dialectic continuum. On the other hand, according to their morphosyntactic characteristics (the presence of the post-positive article, the reduced case system, etc.), they adhere to the eastern range (i. e. Balkan Slavic). This paper views the innovative features of Torlak syntax from a strictly synchronic perspective and as a phenomenon of double (i. e. both head- and dependent-) argument marking. It is argued that cases of double argument marking in Torlak appear when several conditions are met. In order to be archaically marked with an overt relict case marker, a nominal group should either refer to the a-declension or, in case of the other declension types, assume a prominent position not only on the animacy scale but also on the s...
Research Interests:
Telling apart family-internal developments from contact-induced changes and socio-geographically conditioned areal trends is still one of the main challenges in tracing the development - and stability - of languages. Notably prima facie... more
Telling apart family-internal developments from contact-induced changes and socio-geographically conditioned areal trends is still one of the main challenges in tracing the development - and stability - of languages. Notably prima facie contact phenomena might turn out to have resulted from the interaction of a variety of sources upon closer inspection; assessing their interaction is a crucial requirement for comprehending the dynamics of change. Based on the example of simplification patterns in the system of short personal pronouns observed in the Balkan Slavic dialects located in the territory of North Macedonian and Greece, the present paper illustrates the necessity of identifying the multiplicity of triggers and their interaction, thereby making a case for the relevance of small-scale, transient patterns in understanding diachronic processes. In particular, it places the simplifications within the pronominal system into the larger context of differential object indexing by pre...
Research Interests:
When speakers of different languages interact, they are likely to influence each other: contact leaves traces in the linguistic record, which in turn can reveal geographical areas of past human interaction and migration. However, other... more
When speakers of different languages interact, they are likely to influence each other: contact leaves traces in the linguistic record, which in turn can reveal geographical areas of past human interaction and migration. However, other factors may contribute to similarities between languages. Inheritance from a shared ancestral language and universal preference for a linguistic property may both overshadow contact signals. How can we find geographical contact areas in language data, while accounting for the confounding effects of inheritance and universal preference? We present sBayes , an algorithm for Bayesian clustering in the presence of confounding effects. The algorithm learns which similarities are better explained by confounders, and which are due to contact effects. Contact areas are free to take any shape or size, but an explicit geographical prior ensures their spatial coherence. We test sBayes on simulated data and apply it in two case studies to reveal language contact ...
Research Interests:
Research Interests:
Research Interests:
Balkan linguistics, Dialectology, Endangered Languages, Serbian, Corpus Linguistics, and 12 moreSerbian linguistics, South Slavic Languages, Balkan and Slavic Languages, Bulgarian Language, Balkan Sprachbund, Balkan linguistic league, South Slavic Languages and Literatures, Macedonian language, Serbian Language and Literature, Ancient Bulgarian Language, Old Slavic/Bulgarian language, and Torlakian
Research Interests:
SummaryThe article provides a comparative analysis of three different forms of perfect in the western Macedonian dialects which are further compared with analogous forms in the Albanian and the Aromanian languages. It is found out that... more
SummaryThe article provides a comparative analysis of three different forms of perfect in the western Macedonian dialects which are further compared with analogous forms in the Albanian and the Aromanian languages. It is found out that each of the perfect forms has its own range of use: the perfect of
Research Interests:
Research Interests:
The study deals with lexical and syntactic features of the Pope Gregory XI’s consistorial bull to the Order of Saint Paul the First Hermit, written in Avignon in 1371 and translated from Latin into Old Croatian (Čakavian). The focus is on... more
The study deals with lexical and syntactic features of the Pope Gregory XI’s
consistorial bull to the Order of Saint Paul the First Hermit, written in Avignon in
1371 and translated from Latin into Old Croatian (Čakavian). The focus is on the
degree of consciousness of the translator’s work, which does not imply a word for
word (or even morpheme for morpheme) translation, but rather work with the
semantics of the original text and careful selection of language elements. The
analyzed examples demonstrate 1) variability of translation of lexemes and syntactic constructions (cf. for example, Latin persona ‘person’ and Old Croatian
prelat ‘prelate’, stroitelь ‘administrator; abbot, prior, superior’ and oblastnikь
‘administrator, governor’); 2) generalization of meaning of Latin lexemes (cf. for
example, Latin antistes ‘bishop, abbot or prior’, episcopus ‘bishop’ and pontifex
‘pontiff’ and Old Croatian biskupь ‘bishop’); 3) possibility of translating Latin
tokens with phrases (cf., for example, Latin cum usuagiis ‘with the right to use the
forest’ and Old Croatian z d’rêvi plodovitimi i neplod’nimi ‘with fruit-bearing and
non-fruit-bearing trees’). In general, the translation technique of the bull is characterized by a conscious work with the original text at the level of its semantics
and genre.
consistorial bull to the Order of Saint Paul the First Hermit, written in Avignon in
1371 and translated from Latin into Old Croatian (Čakavian). The focus is on the
degree of consciousness of the translator’s work, which does not imply a word for
word (or even morpheme for morpheme) translation, but rather work with the
semantics of the original text and careful selection of language elements. The
analyzed examples demonstrate 1) variability of translation of lexemes and syntactic constructions (cf. for example, Latin persona ‘person’ and Old Croatian
prelat ‘prelate’, stroitelь ‘administrator; abbot, prior, superior’ and oblastnikь
‘administrator, governor’); 2) generalization of meaning of Latin lexemes (cf. for
example, Latin antistes ‘bishop, abbot or prior’, episcopus ‘bishop’ and pontifex
‘pontiff’ and Old Croatian biskupь ‘bishop’); 3) possibility of translating Latin
tokens with phrases (cf., for example, Latin cum usuagiis ‘with the right to use the
forest’ and Old Croatian z d’rêvi plodovitimi i neplod’nimi ‘with fruit-bearing and
non-fruit-bearing trees’). In general, the translation technique of the bull is characterized by a conscious work with the original text at the level of its semantics
and genre.
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
В статье исследуются дискурсивные стратегии билингвальных носителей трех находящихся в контакте диалектов: македонского, арумынского и албанского говоров региона Преспа в Македонии. На основе анализа интерференции в речи билингвов... more
В статье исследуются дискурсивные стратегии билингвальных носителей трех находящихся в контакте диалектов: македонского, арумынского и албанского говоров региона Преспа в Македонии. На основе анализа интерференции в речи билингвов делается вывод о социолингвистическом результате ситуации языкового контакта.
Research Interests:
Research Interests:
This paper provides an analysis of the language situation in the villages of the multi-ethnic Prespa region (Republic of Macedonia): Krani and Arvati. The language situation in these villages appears to be especially noteworthy in... more
This paper provides an analysis of the language situation in the villages of the multi-ethnic Prespa region (Republic of Macedonia): Krani and Arvati. The language situation in these villages appears to be especially noteworthy in juxtaposition with the general language situations in Republic of Macedonia. Interestingly, the local bilingualism has no religion restrictions , so Orthodox Macedonians master Albanian language, which is unusual for other Mace-donian regions. The most significant features of the Prespa language situation are the following: 1) a relatively high degree of linguistic competence of Macedonian informants in the Albanian language; 2) the young age of second language acquisition; 3) the use of the Albanian language as a primary means of communication within the village; 4) the absence of code-mixing in the narrative.
Research Interests:
This article provides a brief overview of the system in the preteritate system of western Macedonian dialects in both the linguistic-internal and areal-typological aspect. In addition to the “simple past tenses”, Macedonian has a... more
This article provides a brief overview of the system in the preteritate
system of western Macedonian dialects in both the linguistic-internal
and areal-typological aspect.
In addition to the “simple past tenses”, Macedonian has a tripartite
series of analytical forms, showing a wide range of perfect (resultative,
experiential) and modal (evidential, admirativ, optative) values. The
grammatical tradition in Macedonian linguistics calls these forms “perfects”.
It is assumed that their appearance in the Macedonian language is
due to, among linguistic-internal reasons, the fact that Macedonian has
been in close contact with other languages of the Balkan linguistic union
(Albanian, Aromanian) and a multi-step process of mutual calquing.
Keywords: Macedonian, Aromanian, perfect, Balkan linguistic
union, calquing, linguistic convergence
system of western Macedonian dialects in both the linguistic-internal
and areal-typological aspect.
In addition to the “simple past tenses”, Macedonian has a tripartite
series of analytical forms, showing a wide range of perfect (resultative,
experiential) and modal (evidential, admirativ, optative) values. The
grammatical tradition in Macedonian linguistics calls these forms “perfects”.
It is assumed that their appearance in the Macedonian language is
due to, among linguistic-internal reasons, the fact that Macedonian has
been in close contact with other languages of the Balkan linguistic union
(Albanian, Aromanian) and a multi-step process of mutual calquing.
Keywords: Macedonian, Aromanian, perfect, Balkan linguistic
union, calquing, linguistic convergence
Research Interests:
In diesem Beitrag werden die Resultate der Feldforschung vorgestellt, die im Laufe einer Reihe von Expeditionen der Jahre 2014–2015 in der mehrsprachigen Region der Prespaseen des geographischen Gebiets Makedonien, durchgeführt wurde.... more
In diesem Beitrag werden die Resultate der Feldforschung vorgestellt,
die im Laufe einer Reihe von Expeditionen der Jahre 2014–2015 in der
mehrsprachigen Region der Prespaseen des geographischen Gebiets Makedonien,
durchgeführt wurde. Diese Forschung hatte folgende Zielsetzungen:
Am Material einer Makroregion und in einer Situation des
unmittelbaren Kontakts mehrerer dialektaler Varietäten sollten die
Ergebnisse des kontaktinduzierten Sprachwandels (die Entwicklung eines
polymorphen Systems) am Beispiel eines einzelnen Segments der Grammatik
– dem der modalen und temporalen Funktionen des Präteritums –
festgestellt und beschrieben werden. Die Forschungsergebnisse haben
gezeigt, daß die formalen Mittel zur Realisierung grammatischer
Bedeutungen, wie sie im semantischen Bereich des Präteritums vorliegen,
in der Mehrzahl der Fälle tatsächlich Isomorphismus aufweisen.
Abweichungen von dieser Gesetzmäßigkeit zeigen sich dort, wo die
Sprachsysteme tiefgreifende formale Diskrepanzen aufweisen. In dem
Artikel wird neben der Vorstellung der Resultate der durchgeführten
Forschungen auch die Behandlung dieser Problemstellung im Rahmen der
bestehenden Literatur diskutiert.
Keywords: balkanslavische Dialektologie, albanische Dialektologie,
balkanromanische Dialektologie, Balkansprachbund, grammatischer Isomorphismus, Präteritalsystem
die im Laufe einer Reihe von Expeditionen der Jahre 2014–2015 in der
mehrsprachigen Region der Prespaseen des geographischen Gebiets Makedonien,
durchgeführt wurde. Diese Forschung hatte folgende Zielsetzungen:
Am Material einer Makroregion und in einer Situation des
unmittelbaren Kontakts mehrerer dialektaler Varietäten sollten die
Ergebnisse des kontaktinduzierten Sprachwandels (die Entwicklung eines
polymorphen Systems) am Beispiel eines einzelnen Segments der Grammatik
– dem der modalen und temporalen Funktionen des Präteritums –
festgestellt und beschrieben werden. Die Forschungsergebnisse haben
gezeigt, daß die formalen Mittel zur Realisierung grammatischer
Bedeutungen, wie sie im semantischen Bereich des Präteritums vorliegen,
in der Mehrzahl der Fälle tatsächlich Isomorphismus aufweisen.
Abweichungen von dieser Gesetzmäßigkeit zeigen sich dort, wo die
Sprachsysteme tiefgreifende formale Diskrepanzen aufweisen. In dem
Artikel wird neben der Vorstellung der Resultate der durchgeführten
Forschungen auch die Behandlung dieser Problemstellung im Rahmen der
bestehenden Literatur diskutiert.
Keywords: balkanslavische Dialektologie, albanische Dialektologie,
balkanromanische Dialektologie, Balkansprachbund, grammatischer Isomorphismus, Präteritalsystem
Research Interests:
Conference Abstract. XLIV International Linguistic Conference, St. Petersburg State University