readCommFile: ERR - timed out after 900 seconds while reading from - Veritas Net Backup

This is a discussion on readCommFile: ERR - timed out after 900 seconds while reading from - Veritas Net Backup ; Hi I have a problem with 1 client. There use Oracle 10g on Solaris 9 with NBU5.1 MP3. Backup function with 1 channel but by more then 2 channels the Backup crash on clientsite with this messages; 08:42:42.573 [18412] int_WriteData: ...

+ Reply to Thread
Results 1 to 4 of 4

Thread: readCommFile: ERR - timed out after 900 seconds while reading from

  1. readCommFile: ERR - timed out after 900 seconds while reading from


    Hi
    I have a problem with 1 client. There use Oracle 10g on Solaris 9 with NBU5.1
    MP3.
    Backup function with 1 channel but by more then 2 channels the Backup crash
    on clientsite with this messages;

    08:42:42.573 [18412] <2> int_WriteData: INF - writing buffer # 1 of size
    262144
    08:42:42.591 [18412] <2> int_CloseImage: INF - Backup - closing
    08:42:57.590 [18412] <2> int_GetBfsDateRange: INF - Using default date range
    08:42:57.590 [18412] <2> int_GetBfsDateRange: INF - Start Time = 12/26/95
    01:00:00
    08:42:57.590 [18412] <2> int_GetBfsDateRange: INF - End Time = 07/20/05 08:42:57
    08:42:57.607 [18412] <2> logconnections: BPRD CONNECT FROM 138.191.79.106.39991
    TO 138.191.1.28.13720
    08:52:32.187 [18411] <16> readCommFile: ERR - timed out after 900 seconds
    while reading from /usr/openv/netbackup/logs/user_ops/dbext/logs/18411.0.1121754968
    08:52:32.187 [18411] <4> close_image: INF - backup FAILED
    08:52:32.187 [18411] <4> close_image: INF ---- end of Backup ---

    08:52:32.187 [18411] <16> VxBSAEndTxn: ERR - bsa_close() failed.
    08:52:32.187 [18411] <16> xbsa_EndTransaction: ERR - VxBSAEndTxn: Failed
    with error:
    The transaction was aborted.
    in the bp.conf the Client_READ_Timeout = 3600 also on mediaserver

    who can help me?

  2. Re: readCommFile: ERR - timed out after 900 seconds while reading from


    "Tom Egger" wrote:
    >
    >Hi
    >I have a problem with 1 client. There use Oracle 10g on Solaris 9 with

    NBU5.1
    >MP3.
    >Backup function with 1 channel but by more then 2 channels the Backup crash
    >on clientsite with this messages;
    >
    >08:42:42.573 [18412] <2> int_WriteData: INF - writing buffer # 1 of size
    >262144
    >08:42:42.591 [18412] <2> int_CloseImage: INF - Backup - closing
    >08:42:57.590 [18412] <2> int_GetBfsDateRange: INF - Using default date range
    >08:42:57.590 [18412] <2> int_GetBfsDateRange: INF - Start Time = 12/26/95
    >01:00:00
    >08:42:57.590 [18412] <2> int_GetBfsDateRange: INF - End Time = 07/20/05

    08:42:57
    >08:42:57.607 [18412] <2> logconnections: BPRD CONNECT FROM 138.191.79.106.39991
    >TO 138.191.1.28.13720
    >08:52:32.187 [18411] <16> readCommFile: ERR - timed out after 900 seconds
    >while reading from /usr/openv/netbackup/logs/user_ops/dbext/logs/18411.0.1121754968
    >08:52:32.187 [18411] <4> close_image: INF - backup FAILED
    >08:52:32.187 [18411] <4> close_image: INF ---- end of Backup ---
    >
    >08:52:32.187 [18411] <16> VxBSAEndTxn: ERR - bsa_close() failed.
    >08:52:32.187 [18411] <16> xbsa_EndTransaction: ERR - VxBSAEndTxn: Failed
    >with error:
    > The transaction was aborted.
    >in the bp.conf the Client_READ_Timeout = 3600 also on mediaserver
    >
    >who can help me?



    This is what i've had set in bp.conf on my Master and ALL Clients, since
    the days of NBU 3.1.1 -

    #
    # rrb - make sure the next 3 timeout params are identical..
    # - bpend_timeout, bpstart_timeout & client_read_timeout...
    BPEND_TIMEOUT = 99999
    BPSTART_TIMEOUT = 99999
    CLIENT_READ_TIMEOUT = 99999
    #



  3. Re: readCommFile: ERR - timed out after 900 seconds while reading from


    "Creaky" wrote:
    >
    >"Tom Egger" wrote:
    >>
    >>Hi
    >>I have a problem with 1 client. There use Oracle 10g on Solaris 9 with

    >NBU5.1
    >>MP3.
    >>Backup function with 1 channel but by more then 2 channels the Backup crash
    >>on clientsite with this messages;
    >>
    >>08:42:42.573 [18412] <2> int_WriteData: INF - writing buffer # 1 of size
    >>262144
    >>08:42:42.591 [18412] <2> int_CloseImage: INF - Backup - closing
    >>08:42:57.590 [18412] <2> int_GetBfsDateRange: INF - Using default date

    range
    >>08:42:57.590 [18412] <2> int_GetBfsDateRange: INF - Start Time = 12/26/95
    >>01:00:00
    >>08:42:57.590 [18412] <2> int_GetBfsDateRange: INF - End Time = 07/20/05

    >08:42:57
    >>08:42:57.607 [18412] <2> logconnections: BPRD CONNECT FROM 138.191.79.106.39991
    >>TO 138.191.1.28.13720
    >>08:52:32.187 [18411] <16> readCommFile: ERR - timed out after 900 seconds
    >>while reading from /usr/openv/netbackup/logs/user_ops/dbext/logs/18411.0.1121754968
    >>08:52:32.187 [18411] <4> close_image: INF - backup FAILED
    >>08:52:32.187 [18411] <4> close_image: INF ---- end of Backup ---
    >>
    >>08:52:32.187 [18411] <16> VxBSAEndTxn: ERR - bsa_close() failed.
    >>08:52:32.187 [18411] <16> xbsa_EndTransaction: ERR - VxBSAEndTxn: Failed
    >>with error:
    >> The transaction was aborted.
    >>in the bp.conf the Client_READ_Timeout = 3600 also on mediaserver
    >>
    >>who can help me?

    >
    >
    >This is what i've had set in bp.conf on my Master and ALL Clients, since
    >the days of NBU 3.1.1 -
    >
    >#
    ># rrb - make sure the next 3 timeout params are identical..
    ># - bpend_timeout, bpstart_timeout & client_read_timeout...
    >BPEND_TIMEOUT = 99999
    >BPSTART_TIMEOUT = 99999
    >CLIENT_READ_TIMEOUT = 99999
    >#
    >
    >



    i've just had a similar problem on over the weekend. i googled for the error
    and all i found was this thread where i'd even posted myself back in July


    from a logfile in dbclient dir -

    11:05:24.387 [26739] <8> connectSock: WRN - connect() to server failed,
    Connection refused (146)
    11:05:34.384 [26739] <16> readCommFile: ERR - timed out after 5 seconds while
    reading from /usr/openv/netbackup/logs/user_ops/dbext/logs/26739.0.1129457054
    11:05:34.384 [26739] <16> serverResponse: ERR - cannot connect on new DATA
    socket bkup-server.4889
    11:05:34.385 [26739] <16> CreateNewImage: ERR - serverResponse() failed
    11:05:34.386 [26739] <16> VxBSACreateObject: ERR - Could not create new image
    with file /38h1b0l7_1_1.
    11:05:34.386 [26739] <16> xbsa_CreateObject: ERR - VxBSACreateObject: Failed
    with error:
    Server Status: Communication with the server has not been iniatated or
    the server status has not been retrieved from the server.
    11:05:43.845 [26739] <4> sbtend: INF - --- END of SESSION ---
    11:05:43.845 [26739] <8> close_image: Session being terminated abnormally,
    cleaning up
    11:05:43.846 [26739] <4> close_image: INF - backup FAILED
    11:05:43.846 [26739] <4> close_image: INF ---- end of Backup ---
    11:05:43.846 [26739] <16> VxBSAEndTxn: ERR - Transaction ended with active
    Backup/Restore.
    11:05:43.846 [26739] <16> xbsa_EndTransaction: ERR - VxBSAEndTxn: Failed
    with error:
    The transaction was aborted.
    11:05:53.726 [26866] <4> VxBSAInit: VERITAS NetBackup XBSA Interface - 5.1
    2004043016

    Anyone got any ideas ?

  4. Re: readCommFile: ERR - timed out after 900 seconds while reading from


    "Creaky" wrote:
    >
    >"Creaky" wrote:
    >>
    >>"Tom Egger" wrote:
    >>>
    >>>Hi
    >>>I have a problem with 1 client. There use Oracle 10g on Solaris 9 with

    >>NBU5.1
    >>>MP3.
    >>>Backup function with 1 channel but by more then 2 channels the Backup

    crash
    >>>on clientsite with this messages;
    >>>
    >>>08:42:42.573 [18412] <2> int_WriteData: INF - writing buffer # 1 of size
    >>>262144
    >>>08:42:42.591 [18412] <2> int_CloseImage: INF - Backup - closing
    >>>08:42:57.590 [18412] <2> int_GetBfsDateRange: INF - Using default date

    >range
    >>>08:42:57.590 [18412] <2> int_GetBfsDateRange: INF - Start Time = 12/26/95
    >>>01:00:00
    >>>08:42:57.590 [18412] <2> int_GetBfsDateRange: INF - End Time = 07/20/05

    >>08:42:57
    >>>08:42:57.607 [18412] <2> logconnections: BPRD CONNECT FROM 138.191.79.106.39991
    >>>TO 138.191.1.28.13720
    >>>08:52:32.187 [18411] <16> readCommFile: ERR - timed out after 900 seconds
    >>>while reading from /usr/openv/netbackup/logs/user_ops/dbext/logs/18411.0.1121754968
    >>>08:52:32.187 [18411] <4> close_image: INF - backup FAILED
    >>>08:52:32.187 [18411] <4> close_image: INF ---- end of Backup ---
    >>>
    >>>08:52:32.187 [18411] <16> VxBSAEndTxn: ERR - bsa_close() failed.
    >>>08:52:32.187 [18411] <16> xbsa_EndTransaction: ERR - VxBSAEndTxn: Failed
    >>>with error:
    >>> The transaction was aborted.
    >>>in the bp.conf the Client_READ_Timeout = 3600 also on mediaserver
    >>>
    >>>who can help me?

    >>
    >>
    >>This is what i've had set in bp.conf on my Master and ALL Clients, since
    >>the days of NBU 3.1.1 -
    >>
    >>#
    >># rrb - make sure the next 3 timeout params are identical..
    >># - bpend_timeout, bpstart_timeout & client_read_timeout...
    >>BPEND_TIMEOUT = 99999
    >>BPSTART_TIMEOUT = 99999
    >>CLIENT_READ_TIMEOUT = 99999
    >>#
    >>
    >>

    >
    >
    >i've just had a similar problem on over the weekend. i googled for the error
    >and all i found was this thread where i'd even posted myself back in July
    >
    >
    >from a logfile in dbclient dir -
    >
    >11:05:24.387 [26739] <8> connectSock: WRN - connect() to server failed,


    >Connection refused (146)
    >11:05:34.384 [26739] <16> readCommFile: ERR - timed out after 5 seconds

    while
    >reading from /usr/openv/netbackup/logs/user_ops/dbext/logs/26739.0.1129457054
    >11:05:34.384 [26739] <16> serverResponse: ERR - cannot connect on new DATA
    >socket bkup-server.4889
    >11:05:34.385 [26739] <16> CreateNewImage: ERR - serverResponse() failed
    >11:05:34.386 [26739] <16> VxBSACreateObject: ERR - Could not create new

    image
    >with file /38h1b0l7_1_1.
    >11:05:34.386 [26739] <16> xbsa_CreateObject: ERR - VxBSACreateObject: Failed
    >with error:
    > Server Status: Communication with the server has not been iniatated

    or
    >the server status has not been retrieved from the server.
    >11:05:43.845 [26739] <4> sbtend: INF - --- END of SESSION ---
    >11:05:43.845 [26739] <8> close_image: Session being terminated abnormally,
    >cleaning up
    >11:05:43.846 [26739] <4> close_image: INF - backup FAILED
    >11:05:43.846 [26739] <4> close_image: INF ---- end of Backup ---
    >11:05:43.846 [26739] <16> VxBSAEndTxn: ERR - Transaction ended with active
    >Backup/Restore.
    >11:05:43.846 [26739] <16> xbsa_EndTransaction: ERR - VxBSAEndTxn: Failed
    >with error:
    > The transaction was aborted.
    >11:05:53.726 [26866] <4> VxBSAInit: VERITAS NetBackup XBSA Interface - 5.1
    > 2004043016
    >
    >Anyone got any ideas ?



    - ignore this one, looked at the error some more and it looks like a client
    connect issue. the 'client_connect_timeout' param is still at it's default
    5mins, and in 5yrs of using NBU i haven't once had to change that value.
    This server exhibited (as yet) unexplained behaviour over the weekend so
    i'm gonna lay the blame at Oracle and not NBU

+ Reply to Thread