The Old Welsh Revitalisation
Framework
(OWRF)
A Phonological Regression and Archaic Restoration Methodology for the Revitalisation of Old Welsh within the BCDRS
Author: Penrith Beacon Communications | PBC Version: 1.0 · 2026 Applied to: Revitalised Cumbric™ dataset (307 entries across 24 lexical and grammatical domains) Data maintained in: Polyglot™ · polyglot.kingarthursroundtable.com
Abstract

The Old Welsh Revitalisation Framework (OWRF) is a phonological regression and archaic restoration methodology for revitalising Old Welsh (c. 800–1150 CE) as a fully operational language state within the Brittonic Convergent Diachronic Revitalisation System (BCDRS). Where the MWRF operates from corpus abundance (~2.8 million words), the OWRF operates from severe corpus scarcity — the entire Old Welsh textual record amounts to only a few thousand words of continuous or semi-continuous prose.

The framework applies systematic phonological regression from the Revitalised Middle Welsh (wlm) input column, using documented sound-change correspondences between Old and Middle Welsh established by Strachan, Willis, and Jackson. The O1 → O2 → O3 pipeline produces the revitalised Old Welsh (owl) column which feeds into the Northern Brittonic Toponymic Revitalisation Framework (NBTRF). The system is governed by PBC's peer-review protocol.

The OWRF is the critical bridge in the BCDRS chain: it converts the relatively abundant Middle Welsh forms into their archaic Old Welsh antecedents, from which the NBTRF can then apply northern Brittonic divergence modelling. The methodological challenge of the OWRF is the reverse of the MWRF's: not selection among abundant evidence, but principled reconstruction from extreme scarcity, constrained by the hard discipline of working only backwards along documented sound-change paths.

§1

Background and Historical Context

Old Welsh — the earliest attested form of the Welsh language — was spoken and written between approximately 800 and 1150 CE. It represents the phase of the language immediately following the fragmentation of the Brittonic dialects of post-Roman Britain into recognisably distinct regional forms, and immediately preceding the classical Middle Welsh period with its rich literary and legal corpus. Old Welsh occupies a position of pivotal importance in the history of the Brittonic languages: it is the form from which both Middle Welsh and Cumbric directly descended, and its phonological and morphological structure is therefore the gateway through which the BCDRS pipeline passes from Middle Welsh to the northern Brittonic dialects.

The great challenge of Old Welsh linguistics is, simply, the scarcity of the surviving corpus. Unlike Middle Welsh — which offers 2.8 million words of digitised prose — the complete Old Welsh textual record amounts to only a few thousand words, preserved in forms that are fragmentary, mixed with Latin, or embedded in later manuscript traditions [1]. This is not the scarcity of a language that failed to produce literature; it is the scarcity of a language whose literature, if it existed, has not survived. The evidence that does survive is preserved almost entirely in contexts of scholarly and ecclesiastical Latin writing — as glosses in the margins of Latin manuscripts, as computational and calendrical notes, and as brief legal and memorial inscriptions.

The principal primary sources are:

  • The Computus Fragment (Cambridge MS Add. 4543, c. 920 CE): the most substantial surviving Old Welsh prose text — a brief computation relating to calendrical reckoning, providing a few hundred words of continuous Welsh prose. It is the touchstone of Old Welsh lexicography and the primary attestation source for the OWRF [2].
  • The Juvencus poems (Cambridge MS Ff.4.42, c. 900 CE): nine Old Welsh englyn stanzas, providing verse forms and lexical evidence not found in the prose sources [3].
  • The Lichfield Gospels (Lichfield Cathedral MS, c. 800 CE): contains the Surrexit memorandum, an Old Welsh legal and memorial text of approximately 120 words — one of the earliest surviving Welsh prose texts, providing pronominal, prepositional, and formulaic evidence [4].
  • The Cadfan Stone (Tywyn, Gwynedd, c. 7th–8th century): a memorial stone inscription providing early Brittonic onomastic evidence [5].
  • Manuscript glosses: scattered Old Welsh words and phrases glossing Latin text in various manuscripts, documented systematically by Strachan and more recently by Sims-Williams and others [1, 6].

These sources together provide a corpus adequate to establish the principal features of Old Welsh phonology, morphology, and syntax — and to support the O1 attestation tier for a small number of high-frequency grammatical forms. However, for the majority of the 307 dataset rows, no direct Old Welsh attestation is available. This makes the OWRF's O2 phonological regression stage the workhorse of the framework.

Old Welsh is linguistically close to Middle Welsh — closer, in most respects, than Middle Welsh is to Modern Welsh. The most significant differences are phonological and orthographic rather than structural: Old Welsh preserves several consonantal distinctions that were lost or merged in Middle Welsh, and its scribal conventions reflect Latin letter-habits that were gradually replaced by more distinctively Welsh orthographic practices as the Middle Welsh period developed. The OWRF's task is to systematically reverse those transitions — to work back from the Middle Welsh forms produced by the MWRF to their Old Welsh antecedents.

§2

Theoretical Foundations

2.1 Phonological Regression as Primary Method

Phonological regression — the principled reconstruction of an earlier phonological form from a later one, using documented sound-change correspondences — is the established methodology of historical linguistics. It is what linguists mean by "reconstruction": not speculation about what a language might have sounded like, but the systematic reversal of known, documented, and independently verified sound changes [7].

The specific sound changes between Old Welsh and Middle Welsh are not hypothetical. They are documented in the major works of Celtic historical linguistics — above all in Kenneth Jackson's Language and History in Early Britain [7], which provides the most comprehensive account of Brittonic phonological history from proto-Celtic through to the medieval Welsh and Cornish periods. Jackson's work establishes, for each documented sound change, the approximate date, the phonological environment, the evidential basis, and the comparative Brittonic context. The OWRF's O2 rules are derived directly from Jackson's documented correspondences and from the more recent syntheses provided by Willis [8] and Sims-Williams [6].

The phonological regression methodology has a clear discipline: it may only apply documented rules in documented environments. It may not apply a rule speculatively in an environment where the rule is not established. And it must default to Middle Welsh baseline retention (O3) wherever the rule application is ambiguous, the environment is unclear, or the evidence for the sound change is insufficient. This discipline is the OWRF's analogue of the NBTRF's Uncertainty Rule.

2.2 The Computus Fragment and Primary Attestation

The Computus Fragment — preserved in Cambridge MS Add. 4543 and dated to approximately 920 CE — is the primary attestation source for the OWRF [2]. It provides directly attested Old Welsh forms for several grammatical items of relevance to the dataset: verbal forms of bod, prepositional forms, pronouns, and a handful of temporal nouns. These attested forms yield Grade A outputs at the O1 stage and provide the closest thing to direct Old Welsh evidence available in the corpus.

The Computus Fragment is not merely a lexical source. Its orthographic conventions — the use of single ⟨d⟩ for /ð/, the use of ⟨p⟩ for /f/ in certain positions, the specific spelling of pronominal and prepositional forms — provide the authoritative model for the OWRF's orthographic normalisation. Where the Computus spells a form in a particular way, that spelling is adopted as the standard OWRF form for that item.

Strachan's edition of the Old Welsh texts, published in 1909 [1], remains the standard critical edition for these materials. Willis's more recent synthesis [8] provides updated linguistic analysis but does not supersede Strachan as an edition of the primary sources.

2.3 Comparative Brittonic Evidence

The Old Welsh corpus is too small to provide evidence for all the forms required by the 307-entry dataset. The OWRF therefore draws systematically on comparative Brittonic evidence — principally from early Cornish and Breton, which share the Old Welsh period as their common ancestor — to support O2 regression where the Old Welsh sources are silent [9, 10].

Koch's work on Brittonic comparative linguistics [11] and James's Brittonic Language in the Old North (BLITON) [12] provide additional evidence from the northern Brittonic record that is directly relevant to the OWRF's task. The northern Brittonic zone — the geographical origin of Cumbric — was conservative in its phonological development, which means that Old Welsh forms representing the general Brittonic state of the period are likely to be a reliable approximation of the northern forms that Cumbric inherited.

2.4 Archaic Morphological Restoration

Beyond phonological regression, the OWRF applies archaic morphological restoration in selected paradigm cells — most significantly in the verbal system, where Strachan's paradigm tables document Old Welsh synthetic inflections that were subsequently lost or replaced by periphrastic constructions in Middle Welsh [1].

The most important case is the present tense 3sg of bod (to be): Middle Welsh yw corresponds to Old Welsh iss/is, the latter form being directly attested in the Computus and in manuscript glosses. Similarly, the Old Welsh verbal noun of the verb "to go" is mynet (Middle Welsh mynd) — a form attested in Old Welsh glosses and providing one of the clearest documented divergences between the two periods. These restoration cases are relatively few in number but are among the highest-confidence outputs the OWRF produces.

§3

The Evidence Hierarchy Model

The OWRF operates a strict hierarchical evidence model. The fundamental asymmetry with the MWRF is that Grade A (direct attestation) is achievable for a much smaller proportion of entries — perhaps 5–15% of the dataset rather than the 55–65% achievable at M1 in the MWRF. The OWRF's default outcome is Grade B (phonological regression) rather than Grade A, and Grade C (Middle Welsh baseline retention) is more frequently applied than in the MWRF.

LevelSourceMethodGrade
O1-ComputusComputus Fragment (Cambridge MS Add. 4543)Direct attestation of Old Welsh formA
O1-LichfieldLichfield Gospels (Surrexit memorandum)Direct attestation; early formulaic and legal WelshA
O1-JuvencusJuvencus poemsDirect attestation; OW verse formsA
O1-GlossesOld Welsh manuscript glosses, per Strachan/WillisDirect attestation; lexical and grammatical itemsA
O1-GPCGPC historical entries for Old Welsh period formsDictionary attestation with primary source citationsA
O2-JacksonJackson, Language and History in Early BritainPhonological regression per Jackson's documented correspondencesB
O2-WillisWillis, Old and Middle WelshPhonological regression per Willis's updated analysisB
O2-StrachanStrachan, Introduction to Early WelshParadigm regression per Strachan's morphological tablesB
O3-wlmRevitalised Middle Welsh columnwlm value retained; OW and MW forms identical or indistinguishableC

A crucial point about the evidence hierarchy: the O3 baseline in the OWRF is the wlm value (Revitalised Middle Welsh), not the cy value (Modern Welsh). The MWRF has already performed the work of identifying the Middle Welsh form; the OWRF takes that form as its starting point. This means that where the wlm and owl values are identical (Grade C), they may nonetheless differ from the Modern Welsh cy value — the distinction between cy and wlm having been established by the MWRF.

§4

The O1 → O2 → O3 Revitalisation Pipeline

The OWRF processes each dataset entry through a three-stage sequential pipeline. The output of each stage becomes the input to the next, and the final output of the pipeline is the value written to the owl (Old Welsh) column.

4.1 Stage O1: Direct Attestation

Sources: Computus Fragment, Lichfield Gospels, Juvencus poems, manuscript glosses (per Strachan and Willis), GPC historical entries for Old Welsh.

O1 asks: is this word or paradigm form directly attested in a surviving Old Welsh primary source? This is the most demanding question the OWRF can ask, and — given the sparseness of the Old Welsh corpus — it can be answered affirmatively for relatively few dataset entries. The items most likely to yield O1 Grade A outputs are: verbal forms of bod attested in the Computus; the verbal noun mynet attested in manuscript glosses; pronominal forms confirmed by the Lichfield Gospels; and a small number of common nouns and particles attested across multiple sources.

The O1 procedure involves four sequential checks:

  1. Consult the GPC entry for the word and check for attestations specifically labelled as Old Welsh period (c. 800–1150 CE). Note the manuscript citation and date.
  2. Check Strachan's Introduction to Early Welsh for any paradigm table that provides a form for this item. Strachan's paradigms, being based on the primary manuscript sources, yield Grade A.
  3. Consult Willis's updated synthesis for any corrections or amplifications to Strachan's forms.
  4. If none of the above provides an attested form, proceed to O2.

4.2 Stage O2: Phonological Regression from Middle Welsh

Source: wlm (Revitalised Middle Welsh) value; Jackson, Willis, Strachan for rule basis.

O2 applies systematic phonological regression to the wlm form, transforming it into its Old Welsh antecedent by reversing the documented sound changes between Old and Middle Welsh. This is the workhorse stage of the OWRF — the stage that handles the large majority of entries for which O1 attestation is unavailable.

O2 outputs are assigned Grade B. They represent academically defensible derivations from Jackson's documented correspondences — not attested forms, but principled reconstructions. The key discipline of O2 is to apply regression rules only in the environments where they are established and to default to O3 wherever the rule application is ambiguous.

4.3 Stage O3: Middle Welsh Baseline Retention

Source: wlm (Revitalised Middle Welsh) value, retained directly.

O3 is applied when O1 attestation is unavailable and O2 regression either produces the same result as the wlm form or cannot be reliably applied. In these cases, the OWRF retains the wlm value and assigns Grade C.

Grade C is applied more frequently in the OWRF than in the MWRF, because the Old Welsh corpus is far smaller. For many items — particularly in the stable categories (days of the week, months, conjunctions, greetings) — the Old Welsh and Middle Welsh forms are effectively identical, and the O2 rules do not apply. For these items, the wlm value is the best available approximation of the Old Welsh form, and it is adopted as Grade C.

§5

Stable Categories

The following categories are expected to yield Grade C outputs as their standard baseline in the OWRF. The basis for this classification differs subtly from the MWRF's stable categories: while the MWRF identifies stable categories on the grounds that MW and ModW forms are effectively identical, the OWRF identifies them on the grounds that OW and MW forms are effectively identical — which is independently true, reflecting the conservatism of these lexical categories across the Old and Middle Welsh period.

Row prefixCategoryBasis for Grade C baseline
DAY_*Days of the weekLatin-derived borrowings stable across OW and MW; attested in Computus Fragment as identical or near-identical to MW forms
MON_*MonthsLatin borrowings; stable across both periods
SEA_*SeasonsGwanwyn, Haf, Hydref, Gaeaf — stable across OW and MW
TIM_*Telling the timePost-medieval register; wlm = cy = owl for these items
TMP_*Temporal wordsStable; OW and MW forms identical or near-identical
CONJ_*ConjunctionsPrincipal Welsh conjunctions largely stable across both periods
PREP_*Prepositions (uninflected)Simple prepositions stable between OW and MW; inflected forms may differ — check Strachan for specific items
GRT_*, INT_*, POL_*Greetings / introductions / politenessPost-medieval phrasebook register; no OW equivalents; wlm = owl for these items

The key contrast with the NBTRF's immutable categories: the OWRF has no immutable categories in the sense of structural prohibitions on investigation. Grade C is an expected evidentiary baseline, not a theoretical constraint. If O1 attestation provides a genuinely different Old Welsh form for a day of the week or a conjunction, that O1 form takes precedence. Grade C reflects the state of the evidence, not a rule about what can be discovered.

However, the OWRF applies Grade C more broadly than the MWRF because the OW corpus is so sparse that even for active derivation categories, the regression rules may not apply — leaving wlm retention as the only defensible option. The OWRF practitioner must be alert to this: a row that looks like an active derivation candidate (e.g., an adjective with ⟨dd⟩ in the wlm form) may still receive Grade C if the regression environment for that specific item is unclear.

§6

Active Derivation Categories

The following categories are expected to yield Grade A or Grade B outputs through O1 attestation or O2 regression. For these categories, either the Old Welsh primary sources provide direct evidence, or the phonological regression rules apply clearly and produce a form distinguishable from the Middle Welsh baseline.

Row prefixCategoryExpected gradeMethod
BE_*bod paradigmA/BStrachan §§87–88; Computus attestation for some cells (3sg *iss*); O2 for remainder
GO_*mynd paradigmA/BOW verbal noun *mynet* attested in glosses (Grade A); finite forms per Strachan §§101–102 (Grade B)
PRN_*PronounsA/BOW and MW pronouns largely identical; Lichfield and Computus confirm key forms (Grade A)
ADJ_*Adjectives with ⟨dd⟩ or ⟨ff⟩BO2 Rules 4a/4b/7: ⟨dd⟩ → ⟨d⟩; ⟨ff⟩ → ⟨p⟩ or ⟨f⟩
HAVE_*cael/caffael paradigmB/CO2 orthographic rules where applicable; wlm baseline for cells not in Strachan
COME_*dyfod paradigmB/CO2 orthographic/phonological rules; Willis provides some OW finite forms
NUM_*Cardinal numbersB/COW number system largely MW-identical; GPC confirms where attested
ORD_*Ordinal numbersB/COW ordinal endings per Strachan where documented
§7

The Confidence and Grade Classification System

All OWRF outputs carry one of three confidence grades:

GradeDefinitionTypical basisATTESTATION_CLASS
ADirectly attested in an Old Welsh primary source (Computus, Lichfield, Juvencus, glosses) as documented by Strachan, Willis, or GPCComputus verbal form; Lichfield pronominal form; GPC Old Welsh entry with manuscript citationDIRECT_ATTESTATION
BSystematically derived by O2 phonological regression from the wlm form, using Jackson's documented sound correspondences⟨dd⟩ → ⟨d⟩ orthographic conversion; /g/ → /ɣ/ soft mutation restoration; synthetic verb ending from StrachanPHONOLOGICAL_REGRESSION
CMiddle Welsh baseline retained; OW form identical to wlm or insufficiently evidenced for regressionStable category; O2 rules do not apply to this item; OW and MW forms confirmed identical by sourcesWLM_BASELINE

The grade distribution in the OWRF is expected to show a much higher proportion of Grade C than the MWRF, reflecting the sparseness of the Old Welsh corpus. Where the MWRF achieves Grade A for approximately 55–65% of entries through Evans and GPC, the OWRF achieves Grade A for perhaps 5–15% — primarily verb forms of bod and mynd, and a small number of pronominal and prepositional forms confirmed by the primary sources.

The critical implication for the downstream NBTRF: the NBTRF takes the owl column as its primary input. For the majority of owl cells, the value will be Grade B or C — a phonologically regressed or wlm-retained form rather than a directly attested Old Welsh form. This does not undermine the NBTRF's outputs; the identity principle (xcb = owl for most entries) means that the owl column's Grade B and C values are inherited into the xcb column with explicit confidence documentation, maintaining full traceability.

§8

Key Phonological Rules: Middle Welsh to Old Welsh

Section §8 is the most technically specific section of the OWRF dissertation. The rules below represent the documented sound-change correspondences between Middle Welsh and Old Welsh, applied in reverse (regression direction) from wlm to owl. Each rule is traceable to Jackson (1953), Willis (2009), or Strachan (1909).

Rule 1 — The Critical Rule: Soft Mutation /g/ → /ɣ/

This is the most diagnostically important difference between Old Welsh and Middle Welsh, and it is the rule with the most significant practical effect on the OWRF dataset.

In Modern Welsh and Middle Welsh, the soft mutation (lenition) of an initial /g/ produces zero — the /g/ is deleted entirely, leaving no consonant. This Ø outcome is uniform across both periods. In Old Welsh, however, the corresponding lenited consonant was /ɣ/ — the voiced velar fricative — the phoneme that stands between the /g/ of unlenited position and the zero of Middle Welsh lenited position [7, §§25–30].

The historical development is: Proto-Brittonic /g/ → OW lenited /ɣ/ → MW/ModW lenited Ø (deletion). The deletion of /ɣ/ is a Middle Welsh development. Old Welsh retained the fricative.

OWRF Application: Wherever a wlm form shows Ø (zero) in a position where soft mutation of underlying /g/ is triggered — after feminine noun determiners, certain prepositions, the predicative particle yn, in the second element of compounds following a vowel-final first element — the owl form must restore initial ⟨g⟩ representing /ɣ/.

Orthographic representation: The OWRF represents Old Welsh /ɣ/ as ⟨g⟩, following the convention of the primary sources (which do not use a distinct grapheme for /ɣ/). The distinction is contextual: underlying /g/ appears in unmutated positions; /ɣ/ appears in mutated positions. Modern scholarly editions typically mark this distinction with diacritics or subscript notation, but the OWRF adopts the ⟨g⟩ convention of the manuscripts.

Middle Welsh form (wlm)Old Welsh form (owl)Notes
(y) las — mutated form of *glas* (blue/green)(y) glasOW retains /ɣ/ spelled ⟨g⟩ where MW has Ø; Jackson §28
Unmutated *glas* (blue/green)glasIdentity — unmutated /g/ unchanged in both periods
(y) wyrdd — mutated form of *gwyrdd* (green)(y) gwyrdd / (y) guirdOW ⟨u⟩ for /w/; /ɣ/ retained; Jackson §28

Rule 2 — Old Welsh Orthographic Conventions: ⟨d⟩ for /ð/ (no ⟨dd⟩)

Middle Welsh represents the voiced dental fricative /ð/ with the digraph ⟨dd⟩ — a distinctively Welsh spelling convention that emerged in the later Old Welsh / early Middle Welsh transition period. Old Welsh, following Latin scribal habits, represented /ð/ with single ⟨d⟩ [1, 8].

Application: Replace all wlm ⟨dd⟩ (representing /ð/) with ⟨d⟩ in the owl form. This is one of the most consistently applicable rules across the dataset — any wlm form containing ⟨dd⟩ requires this substitution.

Middle Welsh (wlm)Old Welsh (owl)Notes
rhudd (red)rhud⟨dd⟩ → ⟨d⟩; Strachan confirms OW orthography
oedd (was, 3sg impf.)oed⟨dd⟩ → ⟨d⟩; Computus attests oed
mynd (to go)mynetVerbal noun form differs entirely — see morphological restoration

Rule 3 — Old Welsh Orthographic Conventions: ⟨p⟩ or ⟨f⟩ for /f/ (no ⟨ff⟩)

Middle Welsh represents the voiceless labio-dental fricative /f/ with the digraph ⟨ff⟩. Old Welsh used ⟨p⟩ in many positions (following the Latin convention for the bilabial fricative derived from Latin F) or single ⟨f⟩ — but did not use ⟨ff⟩ [1]. The ⟨p⟩ convention is particularly evident in forms like OW cap for MW caff (may get) and in the verbal noun of the verb "to get": OW capaul alongside the MW form caffael.

Application: Replace wlm ⟨ff⟩ with ⟨p⟩ (dominant scribal convention) or ⟨f⟩ in the owl form. The choice between ⟨p⟩ and ⟨f⟩ depends on position and the specific item — check Strachan for individual cases. In word-initial position, ⟨f⟩ is more common; in medial and final positions, ⟨p⟩ appears frequently.

Rule 4 — ⟨c⟩ for /k/ (Identity)

Both Old and Middle Welsh represent /k/ with ⟨c⟩. This is an identity rule — no modification required. The use of ⟨c⟩ for /k/ is a conservative Latin-derived convention retained across both periods.

Rule 5 — ⟨t⟩ for /θ/ in Some Positions (Caution)

In some Old Welsh scribal sources, the voiceless dental fricative /θ/ (Middle Welsh ⟨th⟩) is represented by ⟨t⟩, particularly in word-final position. This is inconsistently applied across Old Welsh manuscripts and reflects the Latin scribal convention of using ⟨t⟩ for dental sounds. The OWRF applies this rule with caution: only in cases where Strachan or Willis specifically documents the ⟨t⟩ spelling for a particular item.

Rule 6 — Pronominal Forms (Largely Identity)

Old Welsh and Middle Welsh pronoun forms are largely identical, making this one of the most conservative areas of the grammatical system. The OWRF applies identity for most pronominal rows. The 3pl demonstrates slight variation: both hwy and wynt are found in Old Welsh sources; the OWRF uses hwy as the standard form, following the wlm baseline set by the MWRF.

PersonMiddle Welsh (wlm)Old Welsh (owl)Notes
1sgmimiIdentity; Lichfield confirms
2sgtitiIdentity
3sg mascefefIdentity
3sg femhihiIdentity
1plniniIdentity
2plchwichwiIdentity
3plhwyhwy / wyntBoth attested in OW; hwy used as standard

Rule 7 — Synthetic Verbal Inflections

Old Welsh preserves more synthetic verbal inflections than Middle Welsh, particularly in the paradigm of bod (to be). Key differences documented in Strachan's tables:

Tense / PersonMiddle Welsh (wlm)Old Welsh (owl)Source
Present 3sgywiss / isStrachan §87; Computus attestation (Grade A)
Present 1sgwyfbim / bofStrachan §87 (Grade B)
Imperfect 3sgoeddoedStrachan §88; glosses (Grade A)
Verbal noun (to go)myndmynetGlosses; Jackson §105 (Grade A)

Rule 8 — Definite Article and Prepositions (Identity)

The Old Welsh definite article y/yr is identical to the Middle Welsh form. The major prepositions are largely identical between Old and Middle Welsh. Where Strachan documents a divergent Old Welsh prepositional form, it is applied; otherwise, the OWRF applies identity for prepositions (wlm = owl).

§9

Analytical Results: Application to 307 Entries

The OWRF pipeline has been applied to all 307 entries in the Revitalised Cumbric dataset. The distribution of outcomes reflects the evidentiary conditions of the Old Welsh corpus: much higher Grade C proportions than the MWRF, with Grade A confined to a small number of high-frequency grammatical forms.

StageExpected proportionNotes
O1 — Grade A (Direct Attestation)~5–15% of entriesVerb forms of bod (Computus); verbal noun mynet; key pronominal and prepositional forms
O2 — Grade B (Phonological Regression)~30–45% of entriesAdjectives with ⟨dd⟩/⟨ff⟩; verb paradigm cells in Strachan; items where /g/ → /ɣ/ rule applies
O3 — Grade C (wlm Baseline)~45–60% of entriesStable categories; items where OW and MW are identical; regression rules do not apply

The high Grade C proportion in the OWRF is not a weakness of the framework — it is the correct scholarly outcome given the evidentiary conditions. The OWRF never fabricates a Grade A or B outcome where the evidence does not support it. A Grade C entry (owl = wlm) is a high-confidence claim: it asserts that the Old Welsh form is the same as the Middle Welsh form for this item, which is itself a substantive scholarly judgement grounded in the evidence hierarchy.

Key divergences from wlm in the owl column

DomainExpected owl/wlm relationshipRule applied
bod present 3sgDivergent: owl iss, wlm ywO1 — Computus attestation (Grade A)
mynd verbal nounDivergent: owl mynet, wlm myndO1 — gloss attestation (Grade A)
Adjectives with ⟨dd⟩Divergent: owl ⟨d⟩, wlm ⟨dd⟩O2 Rule 2 (Grade B)
Forms with ⟨ff⟩Divergent: owl ⟨p⟩ or ⟨f⟩, wlm ⟨ff⟩O2 Rule 3 (Grade B)
Soft-mutated /g/ formsDivergent: owl initial ⟨g⟩, wlm ØO2 Rule 1 (Grade B)
Days, months, seasonsIdentity: owl = wlmO3 (Grade C)
Most pronounsIdentity: owl = wlmO1 (Grade A); forms identical
§10

Reliability Constraints and Limitations

The following limitations apply to all OWRF outputs:

  1. Corpus scarcity as the fundamental constraint. The Old Welsh corpus is so small that direct attestation (Grade A) is achievable for a small minority of dataset entries. This means that the majority of owl values rest on phonological regression (Grade B) or Middle Welsh baseline retention (Grade C) rather than on direct evidence. This is a fundamental constraint of the data that no methodology can overcome.
  2. Manuscript variation in Old Welsh sources. The Old Welsh primary sources show considerable orthographic variation, reflecting different scribes, different manuscript traditions, and the strong influence of Latin scribal habits. The OWRF normalises to the conventions documented by Strachan and Willis, but this normalisation involves scholarly judgement.
  3. Regression direction ambiguity. Some sound changes between Old and Middle Welsh are environmentally complex. In ambiguous environments, the OWRF defaults to Grade C (wlm retention). Future scholarship may clarify these ambiguities and enable Grade B regression where Grade C is currently applied.
  4. Downstream NBTRF dependence. For the 282 identity rows in the NBTRF (xcb = owl), the quality of the Cumbric output is bounded by the quality of the owl column. The owl column's Grade B and C proportions mean that the overall confidence of the NBTRF output is constrained by the OWRF's evidentiary position.
  5. OW/MW transition period uncertainty. The 800–1150 CE period is not internally homogeneous. Forms from 800 CE may differ from forms from 1100 CE in ways that the OWRF cannot fully capture. The framework targets the general Old Welsh period rather than any specific date within it.
  6. Human expert verification recommended. The entire owl column should ideally be reviewed by specialists in Old Welsh linguistics. Departments of Celtic Studies at Aberystwyth, Bangor, Cardiff, or Edinburgh would be appropriate reviewing bodies; the School of Celtic Studies at the Dublin Institute for Advanced Studies is particularly appropriate given its role in publishing the standard grammars.
§11

The OWRF within the BCDRS: Between MWRF and NBTRF

The OWRF occupies the middle position in the medial pipeline of the Brittonic Convergent Diachronic Revitalisation System (BCDRS). It receives input from the MWRF and passes its output to the NBTRF. This position is methodologically pivotal: the OWRF is the bridge between the period of Welsh linguistic abundance and the period of near-total Cumbric absence.

Modern Welsh (cy)MWRF → Revitalised Middle Welsh (wlm) → OWRF → Revitalised Old Welsh (owl) → NBTRF → Revitalised Cumbric (xcb)

The OWRF's role in the BCDRS is the conversion operation: it takes the Middle Welsh forms produced by the MWRF and transforms them into Old Welsh antecedents. This matters for the NBTRF because Cumbric was not a dialect of Middle Welsh — it was a dialect of Old Welsh and Late Brittonic. The relationship between Cumbric and the southern Brittonic tradition runs through the Old Welsh period, not the Middle Welsh period. Without the OWRF, the NBTRF would be deriving Cumbric from Middle Welsh forms — forms that already reflect sound changes (the loss of /ɣ/, the development of ⟨dd⟩ and ⟨ff⟩ digraphs, and other Middle Welsh innovations) that post-date the Cumbric period. The OWRF removes these Middle Welsh innovations, restoring the Old Welsh baseline that the NBTRF can legitimately use as the Cumbric starting point.

In one important sense, the OWRF is the most theoretically significant of the three BCDRS frameworks. The MWRF operates in conditions of abundance — its primary challenge is selection. The NBTRF operates under extreme evidentiary constraint — its primary challenge is the discipline of not speculating. The OWRF operates at the inflection point between these two conditions: it applies the tools of historical linguistics to move from abundance to reconstruction, using documented sound changes as the only legitimate path.

The relationship between the OWRF and the NBTRF is one of strict dependency. The NBTRF treats the owl column as its authoritative baseline for the 282 identity entries (xcb = owl). Any improvement to the owl column — any correction of a Grade C entry to Grade B based on new evidence, any correction of a Grade B entry to Grade A based on new attestation — flows automatically through to the xcb column. The integrated nature of the BCDRS pipeline means that improvements anywhere in the chain improve the final output.

For the Cumbric revitalisation project specifically, the OWRF's most significant contribution is the restoration of the /g/ → /ɣ/ distinction in soft mutation environments. This distinction is invisible in the Middle Welsh baseline but is real in Old Welsh — and since Cumbric descends from Old Welsh, the distinction was present in Cumbric in the pre-decay period. Whether the NBTRF should reproduce the /ɣ/ in the xcb column is a question for the NBTRF specification — but the OWRF ensures that the evidence for its existence is properly represented in the owl baseline.

§12

Future Development and Contribution Protocol

The OWRF is designed as a living framework. Old Welsh scholarship, though a specialised field, is active: new critical editions of Old Welsh texts, new analyses of manuscript glosses, advances in comparative Brittonic phonological history, and the growing digitisation of medieval manuscripts all offer prospects for improving the quality of the owl column over time.

Sims-Williams's ongoing work on early Welsh manuscripts [6] is particularly relevant to the OWRF's O1 attestation tier — his systematic cataloguing of Old Welsh manuscript glosses provides the most comprehensive contemporary survey of the primary sources. New editions of early Welsh texts produced by the University of Wales Centre for Advanced Welsh and Celtic Studies continue to expand the edited corpus.

The most significant future improvement to the OWRF would be the expansion of the O1 Grade A tier through new attestation discoveries. Each new Old Welsh gloss or text fragment has the potential to confirm or refine an owl value currently assigned Grade B or C. The framework is designed to accommodate such improvements without requiring structural change — a new O1 finding simply updates the grade and source for the relevant entry in the trace file and the dataset.

External contributions from qualified specialists in Old Welsh linguistics are welcomed under the formal contribution protocol documented at the Contributors page. Contributions relating to the OWRF are held to the same standard as NBTRF contributions: dissertation-format submissions, explicit evidence documentation, and graded confidence assignments. The field expertise required is particularly narrow — the OWRF sits at the intersection of Old Welsh philology and Brittonic historical phonology — and specialist review is especially important for this framework.

The conservative standard is permanent throughout the BCDRS. The OWRF never fabricates Grade A or B outputs where the evidence does not exist. Each Grade C entry is a considered scholarly position, not a gap to be filled by speculation. The goal is not to minimise the number of Grade C entries — it is to ensure that every grade assignment is accurate, every source is documented, and every correction is traceable.

References

References

  • Strachan, John. Introduction to Early Welsh. Manchester: Manchester University Press, 1909. [Foundational edition and grammatical description of Old Welsh; primary source for OWRF paradigm tables and orthographic conventions.]
  • Jones, Mary, ed. and trans. Computus Fragment (Cambridge MS Add. 4543). Online transcription. maryjones.us/ctexts/computus.html. [Primary attested Old Welsh text; source for verbal forms of bod, temporal vocabulary, and Old Welsh orthographic conventions.]
  • Rowland, Jenny, ed. Early Welsh Saga Poetry: A Study and Edition of the Englynion. Cambridge: D. S. Brewer, 1990. [Includes the Juvencus poems among other early Welsh verse; standard edition for OW verse forms.]
  • Davies, Wendy, ed. The Llandaff Charters. Aberystwyth: National Library of Wales, 1979. [Includes analysis of early Welsh documentary texts including the Lichfield Gospels memoranda; standard reference for OW legal and memorial Welsh.]
  • Nash-Williams, V. E. The Early Christian Monuments of Wales. Cardiff: University of Wales Press, 1950. [Includes the Cadfan Stone inscription and other early Brittonic epigraphic evidence.]
  • Sims-Williams, Patrick. The Celtic Inscriptions of Britain: Phonology and Chronology. Oxford: Blackwell, 2003. [Standard reference for early Brittonic epigraphy and phonological history; used for comparative evidence in OWRF O2 derivation.]
  • Jackson, Kenneth Hurlstone. Language and History in Early Britain: A Chronological Survey of the Brittonic Languages, First to Twelfth Century AD. Edinburgh: Edinburgh University Press, 1953. Reprinted Dublin: Four Courts Press, 1994. [The definitive account of Brittonic phonological history; primary academic basis for all OWRF O2 phonological regression rules.]
  • Willis, David. "Old and Middle Welsh." In The Celtic Languages, ed. Martin J. Ball and Nicole Müller. 2nd ed. London: Routledge, 2009. [Modern standard reference for OW and MW phonology and morphology; supplements and in some respects supersedes Strachan.]
  • Koch, John T., and John Carey, eds. The Celtic Heroic Age: Literary Sources for Ancient Celtic Europe and Early Ireland and Wales. 4th ed. Aberystwyth: Celtic Studies Publications, 2003. [Provides early Brittonic and Cornish comparative context for the OWRF's phonological regression rules.]
  • Clancy, Thomas Owen, ed. The Triumph Tree: Scotland's Earliest Poetry AD 550–1350. Edinburgh: Canongate, 1998. [Anthology including Old Welsh and early Brittonic verse from the northern zone; contextualises the northern Brittonic phonological record used by the NBTRF.]
  • Koch, John T. "Cumbric." In Celtic Culture: A Historical Encyclopaedia. Santa Barbara: ABC-CLIO, 2006. [Overview of Cumbric as a northern Brittonic dialect; contextualises the OWRF's relationship to the NBTRF's Cumbric focus.]
  • James, Alan G. Brittonic Language in the Old North (BLITON). Scottish Place-Name Society, 2019 (1st ed.); 2022 (online updated ed.). spns.org.uk/bliton.html. [Comprehensive survey of Brittonic place-name evidence in northern Britain; provides the northern comparative context for OWRF outputs.]
  • Evans, D. Simon. A Grammar of Middle Welsh. Dublin: Dublin Institute for Advanced Studies, 1964. [Standard reference grammar; used by OWRF as upstream source for wlm forms and for comparison with OW paradigm cells.]
  • Williams, Ifor, ed. Canu Aneirin. Cardiff: University of Wales Press, 1938. [Primary edition of Y Gododdin; provides early Brittonic personal name evidence and some linguistic data for the Old Welsh / northern Brittonic transition period.]
  • Wikipedia contributors. "Old Welsh." Wikipedia, The Free Encyclopedia. en.wikipedia.org/wiki/Old_Welsh. [Secondary overview article; used for general context only.]
  • Wikipedia contributors. "Cumbric." Wikipedia, The Free Encyclopedia. en.wikipedia.org/wiki/Cumbric. [Secondary overview article; contextualises the OWRF's downstream relationship to Cumbric.]
  • Jenner, Henry. A Handbook of the Cornish Language. London: David Nutt, 1904. archive.org. [The founding work of Cornish revitalisation; referenced as precedent for Brittonic linguistic reconstruction methodology.]