tlhIngan-Hol Archive: Fri Jan 04 17:21:38 2013
[Date Prev][Date Next][Thread Prev][Thread Next]
Re: [Tlhingan-hol] Fwd: RE: Klingon Scrabble
I can, but I don't have a platform in which I can write a clever
script, so this counts each character for itself not as part of its
Klingon letter. Here's what I get, with my comments.
72543 - That's the space character, what you'd expect for a 75k
word novel.
44671 a - Our existing distribution gets that right. I wonder if
this is biased by character names. The main character is named vajar.
I'll do a version stripped of character and ship names once I have a
better system.
40793 ' - I told you there weren't enough qaghwI'mey in the
game. It beats out all but one vowel!
28488 h - This combines the letter's presence in tlh and gh, but
excludes H.
23699 o
22652 e
21140 I
20469 H - I expected this to be more common in text than in the
dictionary, because it's in -taH and -Ha' and -laH and -moH and -meH ...
20213 u - last of the vowels
19024 l - biased because this includes l and tlh
17380 t - biased by t + tlh
17291 j - interesting. One of the ship names has a j and so does
the main character's name. That might be a factor. But the main
character also has a v and and r, so I don't think so.
14365 . - Heh. Short sentences, eh?
13634 g - A combination of gh + ng
13627 m
13557 D
13455 n - includes n and ng
11737 S
11226 v
9892 q
9647 c
9530 b
9145 p
7685 y
7653 r
6611 w
5793 Q
So it looks like yay ray way and Qay should be the high-scoring
letters. Whoda thunk there were over three times as many Haymey as Qaymey.
As an indication of the cleanliness of the data, here's the rest.
5067 ,
1702 M - two alien characters, one of whom is a main character,
have names starting in M. The names of alien ships and persons is
also the explanation for most of the non-Klingon alphabetic characters below.
1122 ?
691 !
374 s
172 T
171 -
141 i
132 O
126 A
126 @ - The pIqaD 'ay' titles are typed in xifan hol, which
renders the numbers as cartoon swear words.
120 x
94 U
81 E
76 F
70 R
69 &
63 L
62 :
59 G
57 #
57 $
57 %
55 ^
54 N
51 )
51 (
47 *
43 J
41 P
40 C
39 K
39 B
35 k
30 Y
29 W
27 V
24 X
18 d
16 f
At 22:52 '?????' 1/2/2013, you wrote:
Robyn,
Could you analyze your own writings? I bet that would give a good
letter frequency representation.
Tim Stoffel
--
On Tue, 2013-01-01 at 14:09 -0800, Robyn Stewart wrote:
> That's an interesting question. Is the letter frequency distribution
> of a large piece of text different than the frequency distribution in
> a complete wordlist of that language? I think a list compiled just
> from TKD affix and vocabulary lists might competitively
> under-represent qaghwI', as it's in so many affixes.
>
> I found a shortage of qaghwI'mey during game play, but the
> artificiality of the arbitrarily high scores for tlh and ng didn't
> bother me much. It was just a luck thing.
>
> - Qov
>
> At 13:21 '?????' 1/1/2013, Felix Malmenbeck wrote:
> > At the risk of showcasing my ignorance with regards to Scrabble:
> >
> > Does one actually need a corpus to decide character values for
> > Scrabble? I imagine that a lexicon along with the rules for
> > appending affixes would suffice, as the deciding factor is what
> > words can be formed, rather than what words are most commonly used
> > (or do rare/difficult words weigh more heavily in that
> > calculation?).
> >
> >
> > ____________________________________________________________________
> > From: David Holt [[email protected]]
> > Sent: Tuesday, January 01, 2013 22:14
> > To: tlhIngan Hol mailing list
> > Subject: Re: [Tlhingan-hol] Fwd: RE: Klingon Scrabble
> >
> > > On Mon, Apr 14, 2008 at 11:57 PM, Alan Anderson
> > <[email protected]> wrote:
> > > > I got it from DloraH, who got it from janSIy, who I believe
> > originated it.
> >
> > I didn't originate it, but I may have been the first one to bring a
> > converted set to the qep'a'. I got the frequencies and values off
> > this very list and I no longer remember who did the calculations or
> > came up with the values. It was probably 15 years ago. The game is
> > fun, but the scores are somewhat artificial since the point values
> > were based on rarity of English letters and so it's weird to have
> > common letters like <tlh> be worth so many points. I think any new
> > calculations should be based on Qov's <nuq bop bom>, since that is a
> > large piece of original tlhIngan Hol writing.
> >
> > janSIy
> > _______________________________________________
> > Tlhingan-hol mailing list
> > [email protected]
> > http://stodi.digitalkingdom.org/mailman/listinfo/tlhingan-hol
> _______________________________________________
> Tlhingan-hol mailing list
> [email protected]
> http://stodi.digitalkingdom.org/mailman/listinfo/tlhingan-hol
_______________________________________________
Tlhingan-hol mailing list
[email protected]
http://stodi.digitalkingdom.org/mailman/listinfo/tlhingan-hol
_______________________________________________
Tlhingan-hol mailing list
[email protected]
http://stodi.digitalkingdom.org/mailman/listinfo/tlhingan-hol