67436b5aa232c

67436b5aa36d9
1 Guest is here.
 

Topic: SS2 subtitles
Page: « 1 ... 3 [4] 5 ... 8 »
Read 41419 times  

67436b5aa411eZylonBane

67436b5aa4193
You sometimes forgot to remove the stutter, punctuation is wrong in some cases and so on.
Or I intentionally left the stutter in and the punctuation is correct.

I've already added the reaver and SHODAN subtitles and will be posting them later. Rumbler and assassin barks have no spoken words so it seems like a waste of time, but I'll probably do them eventually.
67436b5aa4499
That would be inconsistent and when I'm talking about wrong punctuation, I mean missing punctuation marks. Yes, even you are able to make mistakes.

Ok, then I don't have to do it myself. BTW, would you be interested in a script which automatically creates (non-timed) subtitle files from vlogs and emails or will you really not do them, ever?

67436b5aa4540ZylonBane

67436b5aa458d
If someone provided pre-generated sub files for the logs and emails, I can't think of any reason why I wouldn't use them.
67436b5aa4666
Ok, will send you a PM when ready (next weekend probably).

67436b5aa4709ZylonBane

67436b5aa4751
Here's the current alpha snapshot, with SHODAN and psi reaver voices added, and various other tweaks.

EDIT: Removed attachment. Current version in first post.
« Last Edit: 05. June 2017, 02:43:38 by ZylonBane »
67436b5aa4846
Thanks. I don't get it: The German version provides extra files for the verbal psi reaver sounds but all of them are the same as the english ones. Oh well.

Edit: Which program do you use for synching text and speech?
« Last Edit: 21. November 2015, 01:42:41 by Marvin »
67436b5aa49d6
Ok, that's a tad beyond my budget.

67436b5aa4c6dMooztik

67436b5aa4cbc
If someone provided pre-generated sub files for the logs and emails, I can't think of any reason why I wouldn't use them.

i ve made them for french translation
in sub and timed srt files

67436b5aa4ddaMooztik

67436b5aa4e30
i can extract -> convert english text from .str files to .srt (with generic timing)  and post them here

and convert .srt files into one .sub file

//update
ZylonBane : i formated the sub files like yours
« Last Edit: 21. November 2015, 12:32:22 by Mooztik »
67436b5aa4f82
That would be a big help, don't have to do it myself then.

I'd be grateful if you could do that for the German sounds and texts. You can get the *.str files here (the SCP version, it has the most up-to-date level0x.str files). I'm assuming, of course, you're doing this with an automated script.
« Last Edit: 21. November 2015, 15:33:56 by Marvin »

67436b5aa5016Mooztik

67436b5aa505f
ok, fixing errors in my parser and i send theses files...

67436b5aa514eMooztik

67436b5aa5197
done !

if you find any errors in the text structure, let me kown it. everything is made automaticly. i will fix it.

i've added tabulations to the sub file. then, by using column mode in ultra edit we can copy /paste text or timing to another translated SUB file with same structure.

if you have timed srt files for other subtitles file (other than emails/logs), i can generate sub files.
67436b5aa52f1
Cool. So how are you automatically determining these "generic" timings? String splitting at punctuation marks and setting a default length per character?

One probem is that the German files are encoded in OEM850 as that is the codepage Irrational used for special characters such as umlauts or sharp s (ä,ö,ü,ß,Ä,Ö,Ü). Those are missing in the German files, but that's easy to fix.

67436b5aa551cZylonBane

67436b5aa5577
Ok, that's a tad beyond my budget.
All I use it for is viewing start points and duration. Any free audio editing software would do.

67436b5aa5807Mooztik

67436b5aa5855
Cool. So how are you automatically determining these "generic" timings? String splitting at punctuation marks and setting a default length per character?

One probem is that the German files are encoded in OEM850 as that is the codepage Irrational used for special characters such as umlauts or sharp s (ä,ö,ü,ß,Ä,Ö,Ü). Those are missing in the German files, but that's easy to fix.
yes generic timing is based on poinctuation (. ? ) and lenght from number of characters * 0.53 (medium time to read a character)
but, there is no regularity in theses audiofiles. absolutly horrible if used ingame as is :D

i can change the format .i used it for french files. what format do you need ?

i'm using  'visual subsync' . it's a wysiwig video subtitle creator software. but you can use it with audio files (some tweaks to do).
67436b5aa5abd
All I use it for is viewing start points and duration. Any free audio editing software would do.
I'm using Audacity which is a little bit finnicky. Works, but could be better.

@Mooztik: Your text parser would need to translate the file to something more manageable (UTF-8, ISO 8859) first and then start reading. When I'm editing it, I can reconvert to OEM850 and everything's okay, but if you read the file with, say, UTF-8 and save it in that codepage, all special characters are garbled up forever and I would have to correct everything.
It would be best if you could fix that somehow and don't split the strings. Just put everything in one multisub, I have to go through all files manually anyway and then decide how many parts are necessary.

Visual subsync, I'll remember that, thanks.

67436b5aa5c09Mooztik

67436b5aa5c5a
php act so strangly with caracter encoding... for real it sucks.
i don't have Locale for dutch, then php do nothing with theses strings...
this is why strings keep there format to oem. i can't fix it from inside the parser. need to use Ultraedit.

when you say, 'don't split the strings'  whould you want a  'sub(text)' or a 'multisub{}' with only one line ?
67436b5aa5d57
A multisub with a timing of 00 to some arbitrary large number, 100000 or something like that.

67436b5aa5de9Mooztik

67436b5aa5e32
like that ?
67436b5aa5f2d
Exactly like that, thanks, you're a champ.  :thumb:
67436b5aa6025
Is there any way to switch to a different description within the same multisub block such as when Polito/Shodan is interrupted by Xerxes or the Many?

On a different note: How the fuck are you supposed to subtitle the Many sequences with the interlocking sentences?
« Last Edit: 28. November 2015, 20:50:44 by Marvin »

67436b5aa6190Mooztik

67436b5aa61db
Is there any way to switch to a different description within the same multisub block such as when Polito/Shodan is interrupted by Xerxes or the Many?
I think, no.
actually i removed all the description lines. most of the time we know who is speaking, and when there's more than one caracter, i write the other one uppercases. for exemple xerxes  '<<XERXES IS SPEAKING >>' . purely for aesthetical display.
but, without description, it's possible to simply put the caracter's name before the texte on a new line.

i will surely make two versions for the french subs. one corresponding to the main release by ZylonBane, and another one without any description and some more text cut timings for better french sentences timing.
67436b5aa6605
You can disable the descriptions in cam.cfg with a simple switch so there's no need to delete them from the files, more is always better in this case.


Concerning the emails in general: I've taken a lot more liberty with them than with all the other barks or conversations. In all cases except The Many's one, the emails are almost 100% infodumps. I think that the player will care more about having the information displayed in the most concise and practical manner possible, so I've purposely stuck some sentences together which are separated by a pause if they provide correlated information and I've separated lines which come immediately after each other when they have different topics.
This is easy in Polito's case but Shodan makes so many long pauses, mostly due to stuttering and echo, that her subs are almost exclusively one line per sentence. Then come the Many: They mostly taunt (or horrify) the player, so their subs are more cutscene-like, separated whenever a pause happens, even mid-sentence. Example:
Code: [Select]
multisub EM0503 {
{ time 2300 length 4400 text "Misstrauen ist die Tyrannei des Individuums." }
{ time 7600 length 4150 text "Deine eigene Gattung betrachtet dich als eine Bedrohung." }
{ time 12350 length 3050 text "Warum ermordest du unsere Einheit?" }
{ time 15900 length 3700 text "Egal. Es ist nun alles klar." }
{ time 19850 length 2600 text "Du wirst nicht mehr weiterexistieren." }
{ time 22550 length 4700 text "Es ist nur die Frage, wer dich eliminieren wird:" }
{ time 28600 length 800 text "Wir." }
{ time 29580 length 500 text "Oder." }
{ time 30400 length 1500 text "Du?" }
}
The last three words could easily belong in one sub, but you'd completely miss the dramatic effect. So, to summarize: Polito has longer subs due to infodumps, Shodan has mostly one sub per sentence and the Many has (have?) the shortest subs. This is inconsistent by design but I think fitting the respective speech pattern of the character is more important than consistency in this case. But please engage in a discussion if you feel different.

Another little problem is the interlocking speech of The Many and Shodan. Right now, the subs start when the first word of a sentence is spoken and stops at the last word, no matter if the word comes from an echo and is repeated with larger volume later. Also, especially with The Many, this leads to subs still going when the next is already starting (echo of former sentence is still going on, pre-echo of latter sentence is already beginning). However, the subtitle engine is configured per default to let subs stay longer than defined anyway (subtitles_extra_time <time>, with time set to 3000), so you would only notice if you changed that variable to something very small.

Another thing where I'm stumped is capitalization. Example:
Code: [Select]
{ time 14800 length 1700 text "Verschwinde hier... " }
{ time 17000 length 5500 text "oder wir werden dich so verwunden, wie du uns verwundet hast." }
OR:
Code: [Select]
{ time 14800 length 1700 text "Verschwinde hier... " }
{ time 17000 length 5500 text "Oder wir werden dich so verwunden, wie du uns verwundet hast." }
(same with commata, but with commata it's clearer).


Last but not least: I don't know if I'll ever come around to subbing the logs, but here's a cleaned up version of Mooztik's file without timing. Works, but ugly.
1 Guest is here.
I lost track of how many things are broken or just make no sense, or defy convention out of spite.
Contact SMF 2.0.19 | SMF © 2016, Simple Machines | Terms and Policies
FEEP
67436b5aa981f