| Evaluating Methods to Rediscover Missing Web Pages from the Web Infrastructure. (arXiv:0907.2268v1 [ |
[Jul. 15th, 2009|07:44 am] |
Evaluating Methods to Rediscover Missing Web Pages from the Web Infrastructure. (arXiv:0907.2268v1 [cs.IR])
Missing web pages (pages that return the 404 "Page Not Found" error) are part
of the browsing experience. The manual use of search engines to rediscover
missing pages can be frustrating and unsuccessful. We compare four automated
methods for rediscovering web pages. We extract the page's title, generate the
page's lexical signature (LS), query the bookmarking website delicious.com for
the page's tags and generate a LS from the page's link neighborhood. We use all
methods to query Internet search engines and analyze their retrieval
performance. Our results show that both LSs and titles perform fairly well with
over 60% URLs returned top ranked from Yahoo. However, the combination of
methods improves the retrieval performance. Considering the complexity of the
LS generation, querying the title first and in case of insufficient results
querying the LSs second is the preferable setup. This combination accounts for
more than 75% top ranked URLs.
read more at cs updates on arXiv.org |
|
|
| Hard Fault Analysis of Trivium. (arXiv:0907.2315v1 [cs.CR]) |
[Jul. 15th, 2009|07:44 am] |
Hard Fault Analysis of Trivium. (arXiv:0907.2315v1 [cs.CR])
Fault analysis is a powerful attack to stream ciphers. Up to now, the major
idea of fault analysis is to simplify the cipher system by injecting some soft
faults. We call it soft fault analysis. As a hardware-oriented stream cipher,
Trivium is weak under soft fault analysis.
In this paper we consider another type of fault analysis of stream cipher,
which is to simplify the cipher system by injecting some hard faults. We call
it hard fault analysis. We present the following results about such attack to
Trivium. In Case 1 with the probability not smaller than 0.2396, the attacker
can obtain 69 bits of 80-bits-key. In Case 2 with the probability not smaller
than 0.2291, the attacker can obtain all of 80-bits-key. In Case 3 with the
probability not smaller than 0.2291, the attacker can partially solve the key.
In Case 4 with non-neglectable probability, the attacker can obtain a
simplified cipher, with smaller number of state bits and slower
non-linearization procedure. In Case 5 with non-neglectable probability, the
attacker can obtain another simplified cipher. Besides, these 5 cases can be
checked out by observing the key-stream.
read more at cs updates on arXiv.org |
|
|
| Separations of non-monotonic randomness notions. (arXiv:0907.2324v1 [cs.CC]) |
[Jul. 15th, 2009|07:44 am] |
Separations of non-monotonic randomness notions. (arXiv:0907.2324v1 [cs.CC])
In the theory of algorithmic randomness, several notions of random sequence
are defined via a game-theoretic approach, and the notions that received most
attention are perhaps Martin-Loef randomness and computable randomness. The
latter notion was introduced by Schnorr and is rather natural: an infinite
binary sequence is computably random if no total computable strategy succeeds
on it by betting on bits in order. However, computably random sequences can
have properties that one may consider to be incompatible with being random, in
particular, there are computably random sequences that are highly compressible.
The concept of Martin-Loef randomness is much better behaved in this and other
respects, on the other hand its definition in terms of martingales is
considerably less natural. Muchnik, elaborating on ideas of Kolmogorov and
Loveland, refined Schnorr's model by also allowing non-monotonic strategies,
i.e. strategies that do not bet on bits in order. The subsequent
``non-monotonic'' notion of randomness, now called Kolmogorov-Loveland
randomness, has been shown to be quite close to Martin-Loef randomness, but
whether these two classes coincide remains a fundamental open question. As
suggested by Miller and Nies, we study in this paper weak versions of
Kolmogorov-Loveland randomness, where the betting strategies are non-adaptive
(i.e., the positions of the bits to bet on should be decided before the game).
We obtain a full classification of the different notions we consider.
read more at cs updates on arXiv.org |
|
|
| Optimal Diversity-Multiplexing Tradeoff in Selective-Fading MIMO Channels. (arXiv:0907.2391v1 [cs.IT |
[Jul. 15th, 2009|07:44 am] |
Optimal Diversity-Multiplexing Tradeoff in Selective-Fading MIMO Channels. (arXiv:0907.2391v1 [cs.IT])
We establish the optimal diversity-multiplexing (DM) tradeoff of coherent
time, frequency, and time-frequency selective-fading multiple-input
multiple-output (MIMO) channels and provide a code design criterion for DM
tradeoff optimality. Our results are based on the new concept of the "Jensen
channel" associated to a given selective-fading MIMO channel. While the
original problem seems analytically intractable due to the mutual information
between channel input and output being a sum of correlated random variables,
the Jensen channel is equivalent to the original channel in the sense of the DM
tradeoff and lends itself nicely to analytical treatment. We formulate a
systematic procedure for designing DM tradeoff optimal codes for general
selective-fading MIMO channels by demonstrating that the design problem can be
separated into two simpler and independent problems: the design of an inner
code, or precoder, adapted to the channel statistics (i.e., the selectivity
characteristics) and an outer code independent of the channel statistics. Our
results are supported by appealing geometric intuition, first pointed out for
the flat-fading case by Zheng and Tse, IEEE Trans. Inf. Theory, 2003.
read more at cs updates on arXiv.org |
|
|
| Design of Pulse Shapes and Digital Filters Based on Gaussian Functions. (arXiv:0907.2412v1 [cs.IT]) |
[Jul. 15th, 2009|07:44 am] |
Design of Pulse Shapes and Digital Filters Based on Gaussian Functions. (arXiv:0907.2412v1 [cs.IT])
Two new pulse shapes for communications are presented. The first pulse shape
is ISI-free and identical with the interpolating function (or ISI-free kernel)
of a reconstruction formula in shift-invariant spaces with Gaussian generator.
Several closed form representations in time and frequency domain are given
including one for an approximation that is particularly simple. The second
pulse shape is the root of the former and obtained by spectral factorization.
As a consequence, shifted versions of it form an orthonormal system in the
Hilbert space of finite-energy signals. The latter pulse shape is described as
the response of an infinite-order digital FIR filter on a Gaussian function as
input signal. Several equivalent versions of the digital filter including their
finite-order approximations are presented. All filters enjoy the property that
explicit formulas for their coefficients and poles are available. The filters
are fully parametrizable with respect to bandwidth and sampling rate of the
digital data.
read more at cs updates on arXiv.org |
|
|
| Computing Multidimensional Persistence. (arXiv:0907.2423v1 [cs.CG]) |
[Jul. 15th, 2009|07:44 am] |
Computing Multidimensional Persistence. (arXiv:0907.2423v1 [cs.CG])
The theory of multidimensional persistence captures the topology of a
multifiltration -- a multiparameter family of increasing spaces.
Multifiltrations arise naturally in the topological analysis of scientific
data. In this paper, we give a polynomial time algorithm for computing
multidimensional persistence. We recast this computation as a problem within
computational algebraic geometry and utilize algorithms from this area to solve
it. While the resulting problem is Expspace-complete and the standard
algorithms take doubly-exponential time, we exploit the structure inherent
withing multifiltrations to yield practical algorithms. We implement all
algorithms in the paper and provide statistical experiments to demonstrate
their feasibility.
read more at cs updates on arXiv.org |
|
|
| AWiMA: An architecture for Adhoc Wireless Mobile internet Access. (arXiv:0907.2252v1 [cs.NI]) |
[Jul. 15th, 2009|07:44 am] |
AWiMA: An architecture for Adhoc Wireless Mobile internet Access. (arXiv:0907.2252v1 [cs.NI])
This paper suggests a system architecture for wireless widearea- networking
access using adhoc networking between a mobile Client node without direct
connectivity to a wirelesswide- area-network and a mobile Service Provider node
with connectivity to a wireless-wide-area-network. It provides a means for
securely providing such adhoc wireless networking services using a Server for
tunneling and routing, registration and authentication. The architecture also
provides support for handoff of a Client node from one Service Provider to
another with persistence of a tunnel between the Client and the Server enabling
a soft-handoff. Different wireless protocols may be used for adhoc networking,
with filtered interconnection of authenticated Clients implemented at a Service
Provider node. The architecture is applicable across different wide-areanetwork
protocols, and provides simultaneous support for multiple wide-area-network
protocols.
read more at cs updates on arXiv.org |
|
|
| Sequential Posted Pricing and Multi-parameter Mechanism Design. (arXiv:0907.2435v1 [cs.GT]) |
[Jul. 15th, 2009|07:44 am] |
Sequential Posted Pricing and Multi-parameter Mechanism Design. (arXiv:0907.2435v1 [cs.GT])
We consider the classical mathematical economics problem of {\em Bayesian
optimal mechanism design} where a principal aims to optimize a given objective
when allocating resources to self-interested agents. We show that for general
product distributions on agent preferences and resource allocation problems
that satisfy matroid properties (e.g., multi-unit auctions, matchings, spanning
trees), sequential posted price mechanisms, where agents are approached in-turn
and offered a pre-computed take-it-or-leave-it offer, are at most a
4-approximation to the optimal single-round mechanism. Notably, the analysis of
this sequential posted price mechanism can be extended to give approximation
mechanisms for the unsolved multi-parameter setting. In stark contrast to the
single-parameter setting, in multi-parameter settings there is no general
description or tractable implementation of optimal mechanisms. For decades,
this unanswered issue has been widely considered one of the most important in
the economic theory on mechanism design. We focus on the unit-demand special
case where each agent has a different value for each resource or service but
desires at most one. Our second result is that for general product
distributions on unit-demand preferences and matroid problems, an
easy-to-compute VCG-type mechanism is an 8-approximation to the optimal
(deterministic) mechanism. Our exposition focuses on the objective of profit
maximization, however, the approaches generalize to other objectives that are
linear in valuations and payments.
read more at cs updates on arXiv.org |
|
|
| An Energy-Based Comparison of Long-Hop and Short-Hop Routing in MIMO Networks. (arXiv:0808.0037v4 [c |
[Jul. 15th, 2009|07:44 am] |
An Energy-Based Comparison of Long-Hop and Short-Hop Routing in MIMO Networks. (arXiv:0808.0037v4 [cs.IT] UPDATED)
This paper considers the problem of selecting either routes that consist of
long hops or routes that consist of short hops in a network of multiple-antenna
nodes, where each transmitting node employs spatial multiplexing. This
distance-dependent route selection problem is approached from the viewpoint of
energy efficiency, where a route is selected with the objective of minimizing
the transmission energy consumed while satisfying a target outage criterion at
the final destination. Deterministic line networks and two-dimensional random
networks are considered. It is shown that when 1) the number of hops traversed
between the source and destination grows large or 2) when the target success
probability approaches one or 3) when the number of transmit and/or receive
antennas grows large, short-hop routing requires less energy than long-hop
routing. It is also shown that if both routing strategies are subject to the
same delay constraint, long-hop routing requires less energy than short-hop
routing as the target success probability approaches one. In addition,
numerical analysis indicates that given loose outage constraints, only a small
number of transmit antennas are needed for short-hop routing to have its
maximum advantage over long-hop routing, while given stringent outage
constraints, the advantage of short-hop over long-hop routing always increases
with additional transmit antennas.
read more at cs updates on arXiv.org |
|
|
| A Duality View of Boosting Algorithms. (arXiv:0901.3590v2 [cs.LG] UPDATED) |
[Jul. 15th, 2009|07:43 am] |
A Duality View of Boosting Algorithms. (arXiv:0901.3590v2 [cs.LG] UPDATED)
We study boosting algorithms from a new perspective. We show that the
Lagrange dual problems of AdaBoost, LogitBoost and soft-margin LPBoost with
generalized hinge loss are all entropy maximization problems. By looking at the
dual problems of these boosting algorithms, we show that the success of
boosting algorithms can be understood in terms of maintaining a better margin
distribution by maximizing margins and at the same time controlling the margin
variance.We also theoretically prove that, approximately, AdaBoost maximizes
the average margin, instead of the minimum margin. The duality formulation also
enables us to develop column generation based optimization algorithms, which
are totally corrective. We show that they exhibit almost identical
classification results to that of standard stage-wise additive boosting
algorithms but with much faster convergence rates. Therefore fewer weak
classifiers are needed to build the ensemble using our proposed optimization
technique.
read more at cs updates on arXiv.org |
|
|
| Termination of the Sequence of SDS Sets and Machine Decision for Positive Semi-definite Forms. (arXi |
[Jul. 15th, 2009|07:43 am] |
Termination of the Sequence of SDS Sets and Machine Decision for Positive Semi-definite Forms. (arXiv:0904.4030v2 [cs.SC] UPDATED)
Employing the concept of termination of sequence of SDS sets to describe the
positive semi-definite property of a form, we establish a necessary and
sufficient condition for deciding whether a given form on $\mathcal{R}^n_+$ is
positive semi-definite or not, and show that, for a form which is (strictly)
positive definite on $\mathcal{R}^n_+$, the corresponding sequence of SDS sets
is positively terminating.
The above results are exactly constructed as follows: We define the column
stochastic mean matrix first, and then prove that if we choose countable
infinite matrices from finite $n\times n$ column stochastic mean ones at random
(repeats allowed), then the product of these infinite matrices will converge to
a column stochastic mean matrix with rank 1. Finally, we show the proof for
relations between termination of the sequence of SDS sets and positive
semi-definite property of a form.
The Maple program TSDS3, based upon these results, not only automatically
prove the polynomial inequalities, but also output counter examples for those
false. This method is verified to be very efficient and better than P\`olya
method.
read more at cs updates on arXiv.org |
|
|
| Bits Through Relay Cascades with Half-Duplex Constraint. (arXiv:0906.1599v2 [cs.IT] UPDATED) |
[Jul. 15th, 2009|07:43 am] |
Bits Through Relay Cascades with Half-Duplex Constraint. (arXiv:0906.1599v2 [cs.IT] UPDATED)
Consider a relay cascade, i.e. a network where the source node, the sink node
and a certain number of intermediate relay nodes are arranged in a line. We
assume that adjacent node pairs are connected by error-free (q+1)-ary pipes.
The following communication scenario is treated. The source and a subset of the
relays wish to communicate independent information to a common sink under the
condition that each relay in the cascade is half-duplex constrained. We
introduce a simple channel model for half-duplex constrained links and provide
a coding scheme which transfers information by an information-dependent,
non-deterministic allocation of the transmission and reception slots of the
relays. The coding scheme requires synchronization on the symbol level through
a shared clock. In the case of a relay cascade with a single source, the coding
strategy is capacity achieving. Numerical values for the capacity of cascades
of various lengths are provided, and it turns out that the capacities are
significantly higher than the rates which are achievable with a deterministic
time-sharing approach. If the cascade includes a source and a certain number of
relays with their own information, the strategy achieves the cut-set bound when
the rates of the relay sources fall below individual thresholds. Hence, a
partial characterization of the boundary of the capacity region follows. For
cascades composed of an infinite number of half-duplex constrained relays and a
single source, we derive an explicit capacity expression. Remarkably, the
capacity for q=1 is equal to the logarithm of the golden ratio. We finally show
that the proposed coding strategy is superior to network coding in the case of
the wireless, half-duplex constrained butterfly network.
read more at cs updates on arXiv.org |
|
|
| Spectrum sensing by cognitive radios at very low SNR. (arXiv:0907.1992v2 [cs.IT] UPDATED) |
[Jul. 15th, 2009|07:43 am] |
Spectrum sensing by cognitive radios at very low SNR. (arXiv:0907.1992v2 [cs.IT] UPDATED)
Spectrum sensing is one of the enabling functionalities for cognitive radio
(CR) systems to operate in the spectrum white space. To protect the primary
incumbent users from interference, the CR is required to detect incumbent
signals at very low signal-to-noise ratio (SNR). In this paper, we present a
spectrum sensing technique based on correlating spectra for detection of
television (TV) broadcasting signals. The basic strategy is to correlate the
periodogram of the received signal with the a priori known spectral features of
the primary signal. We show that according to the Neyman-Pearson criterion,
this spectral correlation-based sensing technique is asymptotically optimal at
very low SNR and with a large sensing time. From the system design perspective,
we analyze the effect of the spectral features on the spectrum sensing
performance. Through the optimization analysis, we obtain useful insights on
how to choose effective spectral features to achieve reliable sensing.
Simulation results show that the proposed sensing technique can reliably detect
analog and digital TV signals at SNR as low as -20 dB.
read more at cs updates on arXiv.org |
|
|
| SMT-Based Bounded Model Checking for Embedded ANSI-C Software. (arXiv:0907.2072v2 [cs.SE] UPDATED) |
[Jul. 15th, 2009|07:43 am] |
SMT-Based Bounded Model Checking for Embedded ANSI-C Software. (arXiv:0907.2072v2 [cs.SE] UPDATED)
Propositional bounded model checking has been applied successfully to verify
embedded software but is limited by the increasing propositional formula size
and the loss of structure during the translation. These limitations can be
reduced by encoding word-level information in theories richer than
propositional logic and using SMT solvers for the generated verification
conditions. Here, we investigate the application of different SMT solvers to
the verification of embedded software written in ANSI-C. We have extended the
encodings from previous SMT-based bounded model checkers to provide more
accurate support for finite variables, bit-vector operations, arrays,
structures, unions and pointers. We have integrated the CVC3, Boolector, and Z3
solvers with the CBMC front-end and evaluated them using both standard software
model checking benchmarks and typical embedded applications from
telecommunications, control systems and medical devices. The experiments show
that our approach can analyze larger problems and substantially reduce the
verification time.
read more at cs updates on arXiv.org |
|
|
| An Augmented Lagrangian Approach for Sparse Principal Component Analysis. (arXiv:0907.2079v1 [math.O |
[Jul. 15th, 2009|07:43 am] |
An Augmented Lagrangian Approach for Sparse Principal Component Analysis. (arXiv:0907.2079v1 [math.OC] CROSS LISTED)
Principal component analysis (PCA) is a widely used technique for data
analysis and dimension reduction with numerous applications in science and
engineering. However, the standard PCA suffers from the fact that the principal
components (PCs) are usually linear combinations of all the original variables,
and it is thus often difficult to interpret the PCs. To alleviate this
drawback, various sparse PCA approaches were proposed in literature [15, 6, 17,
28, 8, 25, 18, 7, 16]. Despite success in achieving sparsity, some important
properties enjoyed by the standard PCA are lost in these methods such as
uncorrelation of PCs and orthogonality of loading vectors. Also, the total
explained variance that they attempt to maximize can be too optimistic. In this
paper we propose a new formulation for sparse PCA, aiming at finding sparse and
nearly uncorrelated PCs with orthogonal loading vectors while explaining as
much of the total variance as possible. We also develop a novel augmented
Lagrangian method for solving a class of nonsmooth constrained optimization
problems, which is well suited for our formulation of sparse PCA. We show that
it converges to a feasible point, and moreover under some regularity
assumptions, it converges to a stationary point. Additionally, we propose two
nonmonotone gradient methods for solving the augmented Lagrangian subproblems,
and establish their global and local convergence. Finally, we compare our
sparse PCA approach with several existing methods on synthetic, random, and
real data, respectively. The computational results demonstrate that the sparse
PCs produced by our approach substantially outperform those by other methods in
terms of total explained variance, correlation of PCs, and orthogonality of
loading vectors.
read more at cs updates on arXiv.org |
|
|
| navigation |
| [ |
viewing |
| |
most recent entries |
] |
| [ |
go |
| |
earlier |
] |
| |
|
|