#
# This is a non-breaking prefix list for the Swedish language.
# The file is used for sentence tokenization (text -> sentence splitting).
#
# The file is home-made by a programmer (not a linguist) who doesn't even speak Swedish so it surely can be improved.
#

# Anything in this file, followed by a period (and an upper-case word), does NOT
# indicate an end-of-sentence marker.
# Special cases are included for prefixes that ONLY appear before 0-9 numbers.

# Any single upper case letter followed by a period is not a sentence ender
# (excluding I occasionally, but we leave it in).
# Usually upper case letters are initials in a name.
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Y
Z

# Usually upper case letters are initials in a name (Slovene alphabet)
Č
Š
Ž
Ć
Đ
Ä
Ë
Ö
Ü

# Roman Numerals
I
II
III
IV
V
VI
VII
VIII
IX
X
XI
XII
XIII
XIV
XV
XVI
XVII
XVIII
XIX
XX

# English -- but these work globally for all languages
Mr
Mrs
No
pp
St
no
Sr
Jr
Bros
etc
vs
esp
Fig
fig
Jan
Feb
Mar
Apr
Jun
Jul
Aug
Sep
Sept
Oct
Okt
Nov
Dec
Ph.D
PhD
# in "et al."
al
cf
Inc
Ms
Gen
Sen
Prof
Dr
Corp
Co

# Ante Christum Natum
a.C.n

# http://en.wiktionary.org/wiki/Category:Swedish_abbreviations
ack
adj
adv
amer
anat
anv
arab
aram
arkeol
arkit
astr
bankv
bet
betyd
bibl
bildl
biol
bl.a
bokf
boktr
bot
d
d.v.s
d.y
d.ä
da
data
dets
dial
dim
Dr
dvs
e.d
e.dyl
e.Kr
e. Kr
e.m
eg
ekon
el
eng
etc
ev
ex
exkl
f
f.d
f.Kr
f. Kr
f.m
f.v.t
fam
fem
fig
fil
filos
fonet
forneng
fornfra
fornhögty
fr
fr.o.m
fra
fsv
fys
förk
geogr
geol
geom
germ
got
grek
hand
hebr
hist
holl
ibl
imperf
inf
ink
inkl
inst
interj
it
jap
jmf
jur
kem
kl
komp
konst
l
lat
litt
log
m.fl
m.m
mask
mat
med
medeleng
medelholl
medelhögty
medellågty
medeltidslat
meteor
mil
miner
mus
myt
N.N
neds
neutr
no
nr
o.d
o.dyl
o.s.v
oböjl
omkr
osv
p.g.a
p.m.s
p.s.s
part
pedag
perf.part
pers
plur
polit
port
prep
pres.part
pron
psykol
real
resp
runsv
ry
s.a.s
s.k
s.ö
senlat
sing
sjö
skämts
sl
spa
sport
språkv
subst
särsk
t
t. ex
t.ex
t.o.m
tekn
teol
tex
tr
ty
v.t
vanl
vard
vers
vulgärlat
y
zool
äv
åld.

# http://norwegianlanguage.info/grammar/abbrev.html ("norwegianlanguage.info" but lists Swedish abbreviations too)
allm
ang
bl.a
ca
d.s
d.v.s
d.y
d.ä
eg
e
e.Kr
e. Kr
el
ev
f
f.d
f
f.Kr
f. Kr
fr.o.m
frk
följ
g
ggr
hr
i.st.f
jfr
kung
kl
m
m.a.o
m.fl
m.h.t
m.m
N.N
nr
o.d
omkr
o.s.v
s
s.k
t.ex
t.h
t.o.m
t.v
utg
vanl

# Number indicators
# add #NUMERIC_ONLY# after the word if it should ONLY be non-breaking when a
# 0-9 digit follows it
Nej
nej
