SearchFAQMemberlist Log in
Reply to topic Page 1 of 1
CharSet coding in files names
Author Message
Post CharSet coding in files names 
Hi again,

I got some problems with files names in french accentued characters, the =
file
"cr=E9a.eps" is displayed "cr=C2=82a 1.eps" and "Lumi=E8re trouble.jpg" i=
s displayed
"Lumi=C3=A8re trouble.jpg" on WinXP and Linux-Samba machines.

Does the last beta version fix this internationalization problem therwise=
how
to fix it ?

We use Debian-Linux unstable 2.4.25 with fr-FR locale.

Thanks in advance.

Sam Przyswa.

--
Sam Przyswa - Chef de projet
Arial Concept - Int=E9grateur Internet
36, rue de Turin - 75008 - Paris - France
Tel: 01 40 54 86 04 - Fax: 01 40 54 83 01
Web: http://www.arial-concept.com - Email: Info < at > arial-concept.com




-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
Sam,

I'm having a similar issue with 2.02 Some accented characters are being
represented correctly for the Windows users and under Samba, but
BackupPC converts them to other characters, hence several "File not
found" errors in the logs.

Craig is in Asia this week, so will probably be a little bit before he
can look at it.

In our case, we are not using a foreign character set, so renaming the
files isn't an issue. Seems Word likes to spell Cafe with the accented e
and when the end user copies/pastes the text as the file name, we get
the problem.

I'm running the new beta on my machine at home, after I get this out,
I'll see if it fixes the problem.

Doug

Sam Przyswa wrote:

Hi again,

I got some problems with files names in french accentued characters, the=
file
"cr=E9a.eps" is displayed "cr=C2=82a 1.eps" and "Lumi=E8re trouble.jpg" =
is displayed
"Lumi=C3=A8re trouble.jpg" on WinXP and Linux-Samba machines.

Does the last beta version fix this internationalization problem therwis=
e how
to fix it ?

We use Debian-Linux unstable 2.4.25 with fr-FR locale.

Thanks in advance.









-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
Didn't make any different. I created a test file named caf=E9.txt,

The beta came up saying file not found. So, guess the new code doesn't
handle this, or something with Perl needs to be added.

Doug



Sam Przyswa wrote:

Hi again,

I got some problems with files names in french accentued characters, the=
file
"cr=E9a.eps" is displayed "cr=C2=82a 1.eps" and "Lumi=E8re trouble.jpg" =
is displayed
"Lumi=C3=A8re trouble.jpg" on WinXP and Linux-Samba machines.






-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
Hi,

I tested a restore of file "Lumi=C3=83=C2=A8re trouble.jpg" on WinXP and =
the
restored file is "Lumi=C3=A8re trouble.jpg" with the 2.02 version. The
display is wrong but the fonctionality is right (backup and restore)
it's just a cosmetic fix to do.

Thanks for your help.

Sam.=20

Le mer 24/03/2004 =C3=A0 01:21, Doug Lytle a =C3=A9crit :
Didn't make any different. I created a test file named caf=C3=A9.txt,
=20
The beta came up saying file not found. So, guess the new code doesn't=20
handle this, or something with Perl needs to be added.
=20
Doug
=20
=20
=20
Sam Przyswa wrote:
=20
Hi again,

I got some problems with files names in french accentued characters, t=
he file
"cr=C3=A9a.eps" is displayed "cr=C3=82=E2=80=9Aa 1.eps" and "Lumi=C3=A8=
re trouble.jpg" is displayed
"Lumi=C3=83=C2=A8re trouble.jpg" on WinXP and Linux-Samba machines.
=20


--=20

Sam Przyswa - Chef de projet
Arial Concept - Int=C3=A9grateur Internet
36, rue de Turin - 75008 - Paris - France
Tel: 01 40 54 86 04 - Fax: 01 40 54 83 01
Web: http://www.arial-concept.com - Email: Info < at > arial-concept.com


--=20
Ce message a =E9t=E9 v=E9rifi=E9 par MailScanner et le moteur F-Prot pour
rechercher la pr=E9sence de virus et rien de suspect n'a =E9t=E9 trouv=E9=
.
Pour tous renseignements compl=E9mentaires concernant cet anti-virus
contactez Postmaster < at > arial-concept.com




-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
On Tue, 2004-03-23 at 18:21, Doug Lytle wrote:
Didn't make any different. I created a test file named caf=C3=A9.txt,
=20
The beta came up saying file not found. So, guess the new code doesn't=20
handle this, or something with Perl needs to be added.

Could be a bug in perl. Are you using 5.8.3?

---
Les Mikesell
les < at > futuresource.com



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
5.8.1 under Mandrake 9.2

Doug

Les Mikesell wrote:

On Tue, 2004-03-23 at 18:21, Doug Lytle wrote:


Didn't make any different. I created a test file named caf=C3=A9.txt,

The beta came up saying file not found. So, guess the new code doesn't =

handle this, or something with Perl needs to be added.



Could be a bug in perl. Are you using 5.8.3?

---
Les Mikesell
les < at > futuresource.com



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id638&op=3Dclick
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/








-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
On Tue, 2004-03-23 at 19:48, Doug Lytle wrote:
5.8.1 under Mandrake 9.2

5.8.3 is current and fixes some problems with character set
handling.

---
Les Mikesell
les < at > futuresource.com




-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
Les Mikesell (les < at > futuresource.com) =E9crivait:

On Tue, 2004-03-23 at 19:48, Doug Lytle wrote:
5.8.1 under Mandrake 9.2

5.8.3 is current and fixes some problems with character set
handling.


We use Perl 5.8.3 version with BackupPC 2.0.2

Sam Przyswa.



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
in rh9 i solved the problem with charsets with following fixes

updated samba to samba-3.0.2a-1_rh9.i386.rpm (available from samba.org)

in /etc/sysconfig/i18n
LANG="C"
SUPPORTED="en_US:en:fi_FI < at > euro:fi_FI:fi"
SYSFONT=lat0-16
LC_CTYPE="fi_FI < at > euro"
LESSCHARSET="latin1"

and adding "-O codepage=cp850" to all $Conf{SmbClientFullCmd} lines in
backuppc's config.pl like
$Conf{SmbClientFullCmd} = '$smbClientPath \\\\$host\\$shareName'
. ' $I_option -O codepage=cp850 -U $userName -E -N -d 1'
. ' -c tarmode\\ full -Tc$X_option - $fileList';

this way atleast for me the scandinavian letters started working

timppa



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
Sam Przyswa writes:

I tested a restore of file "Lumière trouble.jpg" on WinXP and the
restored file is "Lumière trouble.jpg" with the 2.02 version. The
display is wrong but the fonctionality is right (backup and restore)
it's just a cosmetic fix to do.

It sounds like you and Doug have different issues.

Just so I understand, Sam, you have file names with special characters
that are not displayed correctly in the CGI interface, but the files
backup and restore correctly. There's probably a $Conf{CgiHeaders}
setting that would fix this. If you view the html source how does
the file name appear? If you look in the backup directory tree,
how does the file name appear? Could you email me off-list a zip
file with a couple of these files (contents don't matter)?

Doug, you have file names with special characters that do not
backup correctly (ie: get a file not found error in smbclient).
If that's the case, then this issue is specific to smbclient
and should be fixed with an appropriate codeset or unix charset
setting. I would also like some examples of these files too;
again the contents don't matter.

Craig


-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
"Timppa Airaksinen" writes:

in rh9 i solved the problem with charsets with following fixes

updated samba to samba-3.0.2a-1_rh9.i386.rpm (available from samba.org)

in /etc/sysconfig/i18n
LANG="C"
SUPPORTED="en_US:en:fi_FI < at > euro:fi_FI:fi"
SYSFONT=lat0-16
LC_CTYPE="fi_FI < at > euro"
LESSCHARSET="latin1"

and adding "-O codepage=cp850" to all $Conf{SmbClientFullCmd} lines in
backuppc's config.pl like
$Conf{SmbClientFullCmd} = '$smbClientPath \\\\$host\\$shareName'
. ' $I_option -O codepage=cp850 -U $userName -E -N -d 1'
. ' -c tarmode\\ full -Tc$X_option - $fileList';

this way atleast for me the scandinavian letters started working

...and just to confirm, the file names appear correctly in the CGI
interface too?

Craig


-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
At 11:04 AM 3/24/2004 +0200, Timppa Airaksinen wrote:
in rh9 i solved the problem with charsets with following fixes

updated samba to samba-3.0.2a-1_rh9.i386.rpm (available from samba.org)

in /etc/sysconfig/i18n
LANG="C"
SUPPORTED="en_US:en:fi_FI < at > euro:fi_FI:fi"
SYSFONT=lat0-16
LC_CTYPE="fi_FI < at > euro"
LESSCHARSET="latin1"

and adding "-O codepage=cp850" to all $Conf{SmbClientFullCmd} lines in
backuppc's config.pl like
$Conf{SmbClientFullCmd} = '$smbClientPath \\\\$host\\$shareName'
. ' $I_option -O codepage=cp850 -U $userName -E -N -d 1'
. ' -c tarmode\\ full -Tc$X_option - $fileList';

this way atleast for me the scandinavian letters started working

I have Samba 3.0 installed, using utf-8, and Windows clients can create
files with mixed Korean/Japanese characters in the same file name on this
server. It works well.

I have looked through the Samba documentation, and cannot find
documentation on the "-O codepage=xxxx" option. If someone can provide a
pointer to what this option does, I would appreciate it.

Thank you.


Marlin Prowell
Cadalog, Inc.




-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
Hi Marlin,

I have Samba 3.0 installed, using utf-8, and Windows clients can create
files with mixed Korean/Japanese characters in the same file name on this
server. It works well.

After I installed Samba 3.0, and I tried using utf-8.

It almost worked well with BackupPC. But when I'm restoring the file using
Japanese charset, I found weird characters in the filename.
Utf-8 is good solution, but it seems Microsoft IE can accept only shift_jis
in
the filename downloading.

If you can try that, let me know about the restoring.

Koichi




-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
in rh9 i solved the problem with charsets with following fixes

updated samba to samba-3.0.2a-1_rh9.i386.rpm (available from
samba.org)

in /etc/sysconfig/i18n
LANG="C"
SUPPORTED="en_US:en:fi_FI < at > euro:fi_FI:fi"
SYSFONT=lat0-16
LC_CTYPE="fi_FI < at > euro"
LESSCHARSET="latin1"

and adding "-O codepage=cp850" to all
$Conf{SmbClientFullCmd} lines in
backuppc's config.pl like $Conf{SmbClientFullCmd} = '$smbClientPath
\\\\$host\\$shareName'
. ' $I_option -O codepage=cp850 -U $userName -E -N -d 1'
. ' -c tarmode\\ full -Tc$X_option - $fileList';

this way atleast for me the scandinavian letters started working

...and just to confirm, the file names appear correctly in
the CGI interface too?

Craig

yes they do. ive no idea why i had to include it there too (the -O in
$Conf{SmbClientFullCmd}) because i think it should be the same as if it was
written in smb.conf, which didnt work, but anyway this way it was solved
...atleast for me.

timppa



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Post CharSet coding in files names 
At 11:22 PM 3/27/2004 +0900, Koichi Kubo wrote:
I have Samba 3.0 installed, using utf-8, and Windows clients can create
files with mixed Korean/Japanese characters in the same file name on this
server. It works well.

After I installed Samba 3.0, and I tried using utf-8.

It almost worked well with BackupPC. But when I'm restoring the file using
Japanese charset, I found weird characters in the filename.
Utf-8 is good solution, but it seems Microsoft IE can accept only shift_jis
in
the filename downloading.

If you can try that, let me know about the restoring.

Well, in just two sentences, I seem to have confused five entirely separate
issues. That might be a new record.

I have a Samba 3.0 server running on FreeBSD. Japanese or Korean Windows
machines can create mixed language files on the server. The files exist on
FreeBSD, and are visible to Windows.

As an aside, since I have Asian language support on my English machine, I
can also see those files correctly. These are the only combinations that I
can confirm to work.

I don't know about the following:

Can smbclient, running on *nix, correctly see multi-byte and/or utf-8 files
on a Windows box? This is the reverse of the above situation. I don't know.

Note that neither of these points involve BackupPC.

Here are some more points to be tested:

Can BackupPC use smbclient and reliably fetch mixed language utf-8 files
from a Windows box?

Can BackupPC use smbclient to restore mixed language utf-8 files to a
Windows box?

Can BackupPC reliably backup and restore the utf-8 files that are on the
Samba server (created in my original example) by using the native *nix file
system? That is, can BackupPC back up a unicode enabled Samba server?

So, as to your original question, I actually have not tried restores of
Japanese-named files with BackupPC. I would not be surprised if there were
problems.

And you have an important point about filename downloading. Although IE
can display utf-8 just fine, when it is time to download a file, it may
well require the file information be sent in shift-jis. So that may be
another client-level configuration option in BackupPC.

This is similar to the Japanese email issue. I would like to use utf-8 for
sending email, but too many email clients cannot accept it. So, although
the web sites are entirely in utf-8, I convert all the email strings to
iso-2022-jp right before sending an email.


Marlin Prowell
Cadalog, Inc.




-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Display posts from previous:
Reply to topic Page 1 of 1
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
  


Magic SEO URL for phpBB