Home
ArXiv.org [entries|archive|friends|userinfo]
arxiv_cs

[ userinfo | livejournal userinfo ]
[ archive | journal archive ]

Evaluating Methods to Rediscover Missing Web Pages from the Web Infrastructure. (arXiv:0907.2268v1 [ [Jul. 15th, 2009|07:44 am]

Evaluating Methods to Rediscover Missing Web Pages from the Web Infrastructure. (arXiv:0907.2268v1 [cs.IR])

Missing web pages (pages that return the 404 "Page Not Found" error) are part of the browsing experience. The manual use of search engines to rediscover missing pages can be frustrating and unsuccessful. We compare four automated methods for rediscovering web pages. We extract the page's title, generate the page's lexical signature (LS), query the bookmarking website delicious.com for the page's tags and generate a LS from the page's link neighborhood. We use all methods to query Internet search engines and analyze their retrieval performance. Our results show that both LSs and titles perform fairly well with over 60% URLs returned top ranked from Yahoo. However, the combination of methods improves the retrieval performance. Considering the complexity of the LS generation, querying the title first and in case of insufficient results querying the LSs second is the preferable setup. This combination accounts for more than 75% top ranked URLs.


read more at cs updates on arXiv.org
linkpost comment

An Efficient Algorithm for Factoring Polynomials over Algebraic Extension Field. (arXiv:0907.2300v1 [Jul. 15th, 2009|07:44 am]

An Efficient Algorithm for Factoring Polynomials over Algebraic Extension Field. (arXiv:0907.2300v1 [cs.SC])

An efficient algorithm is presented for factoring polynomials over an algebraic extension field. The extension field is defined by a polynomial ring modulo a maximal ideal. If the ideal is given by its Gr\"obner basis, no extra Gr\"obner basis computation is needed for factoring a polynomial over the extension field. We will only use linear algebra to get a polynomial over the base field by a generic linear map, and this polynomial will be factorized over the base field. From these factors, the factorization of the polynomial over the extension field can be obtained. The algorithm has been implemented and the experiments show that our algorithm is very efficient.


read more at cs updates on arXiv.org
linkpost comment

Protocols and Performance Limits for Half-Duplex Relay Networks. (arXiv:0907.2309v1 [cs.IT]) [Jul. 15th, 2009|07:44 am]

Protocols and Performance Limits for Half-Duplex Relay Networks. (arXiv:0907.2309v1 [cs.IT])

This paper concentrates on relaying as one possibility to improve data rates in next-generation mobile networks. More specifically, it introduces protocols and analyzes performance limits in the half-duplex relay channel where multiple relay nodes support a single communication pair. In this channel, nodes underly the orthogonality constraint, which prohibits simultaneous receiving and transmitting on the same time-frequency resource. Based upon this practical consideration, different protocols are discussed and evaluated using a Gaussian system model.


read more at cs updates on arXiv.org
linkpost comment

Hard Fault Analysis of Trivium. (arXiv:0907.2315v1 [cs.CR]) [Jul. 15th, 2009|07:44 am]

Hard Fault Analysis of Trivium. (arXiv:0907.2315v1 [cs.CR])

Fault analysis is a powerful attack to stream ciphers. Up to now, the major idea of fault analysis is to simplify the cipher system by injecting some soft faults. We call it soft fault analysis. As a hardware-oriented stream cipher, Trivium is weak under soft fault analysis.

In this paper we consider another type of fault analysis of stream cipher, which is to simplify the cipher system by injecting some hard faults. We call it hard fault analysis. We present the following results about such attack to Trivium. In Case 1 with the probability not smaller than 0.2396, the attacker can obtain 69 bits of 80-bits-key. In Case 2 with the probability not smaller than 0.2291, the attacker can obtain all of 80-bits-key. In Case 3 with the probability not smaller than 0.2291, the attacker can partially solve the key. In Case 4 with non-neglectable probability, the attacker can obtain a simplified cipher, with smaller number of state bits and slower non-linearization procedure. In Case 5 with non-neglectable probability, the attacker can obtain another simplified cipher. Besides, these 5 cases can be checked out by observing the key-stream.


read more at cs updates on arXiv.org
linkpost comment

Separations of non-monotonic randomness notions. (arXiv:0907.2324v1 [cs.CC]) [Jul. 15th, 2009|07:44 am]

Separations of non-monotonic randomness notions. (arXiv:0907.2324v1 [cs.CC])

In the theory of algorithmic randomness, several notions of random sequence are defined via a game-theoretic approach, and the notions that received most attention are perhaps Martin-Loef randomness and computable randomness. The latter notion was introduced by Schnorr and is rather natural: an infinite binary sequence is computably random if no total computable strategy succeeds on it by betting on bits in order. However, computably random sequences can have properties that one may consider to be incompatible with being random, in particular, there are computably random sequences that are highly compressible. The concept of Martin-Loef randomness is much better behaved in this and other respects, on the other hand its definition in terms of martingales is considerably less natural. Muchnik, elaborating on ideas of Kolmogorov and Loveland, refined Schnorr's model by also allowing non-monotonic strategies, i.e. strategies that do not bet on bits in order. The subsequent ``non-monotonic'' notion of randomness, now called Kolmogorov-Loveland randomness, has been shown to be quite close to Martin-Loef randomness, but whether these two classes coincide remains a fundamental open question. As suggested by Miller and Nies, we study in this paper weak versions of Kolmogorov-Loveland randomness, where the betting strategies are non-adaptive (i.e., the positions of the bits to bet on should be decided before the game). We obtain a full classification of the different notions we consider.


read more at cs updates on arXiv.org
linkpost comment

Cooperation in Subset Team Games: Altruism and Selfishness. (arXiv:0907.2376v1 [cs.GT]) [Jul. 15th, 2009|07:44 am]

Cooperation in Subset Team Games: Altruism and Selfishness. (arXiv:0907.2376v1 [cs.GT])

This paper extends the theory of subset team games, a generalization of cooperative game theory requiring a payoff function that is defined for all subsets of players. This subset utility is used to define both altruistic and selfish contributions of a player to the team. We investigate properties of these games, and analyze the implications of altruism and selfishness for general situations, for prisoner's dilemma, and for a specific game with a Cobb-Douglas utility.


read more at cs updates on arXiv.org
linkpost comment

Optimal Diversity-Multiplexing Tradeoff in Selective-Fading MIMO Channels. (arXiv:0907.2391v1 [cs.IT [Jul. 15th, 2009|07:44 am]

Optimal Diversity-Multiplexing Tradeoff in Selective-Fading MIMO Channels. (arXiv:0907.2391v1 [cs.IT])

We establish the optimal diversity-multiplexing (DM) tradeoff of coherent time, frequency, and time-frequency selective-fading multiple-input multiple-output (MIMO) channels and provide a code design criterion for DM tradeoff optimality. Our results are based on the new concept of the "Jensen channel" associated to a given selective-fading MIMO channel. While the original problem seems analytically intractable due to the mutual information between channel input and output being a sum of correlated random variables, the Jensen channel is equivalent to the original channel in the sense of the DM tradeoff and lends itself nicely to analytical treatment. We formulate a systematic procedure for designing DM tradeoff optimal codes for general selective-fading MIMO channels by demonstrating that the design problem can be separated into two simpler and independent problems: the design of an inner code, or precoder, adapted to the channel statistics (i.e., the selectivity characteristics) and an outer code independent of the channel statistics. Our results are supported by appealing geometric intuition, first pointed out for the flat-fading case by Zheng and Tse, IEEE Trans. Inf. Theory, 2003.


read more at cs updates on arXiv.org
linkpost comment

Design of Pulse Shapes and Digital Filters Based on Gaussian Functions. (arXiv:0907.2412v1 [cs.IT]) [Jul. 15th, 2009|07:44 am]

Design of Pulse Shapes and Digital Filters Based on Gaussian Functions. (arXiv:0907.2412v1 [cs.IT])

Two new pulse shapes for communications are presented. The first pulse shape is ISI-free and identical with the interpolating function (or ISI-free kernel) of a reconstruction formula in shift-invariant spaces with Gaussian generator. Several closed form representations in time and frequency domain are given including one for an approximation that is particularly simple. The second pulse shape is the root of the former and obtained by spectral factorization. As a consequence, shifted versions of it form an orthonormal system in the Hilbert space of finite-energy signals. The latter pulse shape is described as the response of an infinite-order digital FIR filter on a Gaussian function as input signal. Several equivalent versions of the digital filter including their finite-order approximations are presented. All filters enjoy the property that explicit formulas for their coefficients and poles are available. The filters are fully parametrizable with respect to bandwidth and sampling rate of the digital data.


read more at cs updates on arXiv.org
linkpost comment

Computing Multidimensional Persistence. (arXiv:0907.2423v1 [cs.CG]) [Jul. 15th, 2009|07:44 am]

Computing Multidimensional Persistence. (arXiv:0907.2423v1 [cs.CG])

The theory of multidimensional persistence captures the topology of a multifiltration -- a multiparameter family of increasing spaces. Multifiltrations arise naturally in the topological analysis of scientific data. In this paper, we give a polynomial time algorithm for computing multidimensional persistence. We recast this computation as a problem within computational algebraic geometry and utilize algorithms from this area to solve it. While the resulting problem is Expspace-complete and the standard algorithms take doubly-exponential time, we exploit the structure inherent withing multifiltrations to yield practical algorithms. We implement all algorithms in the paper and provide statistical experiments to demonstrate their feasibility.


read more at cs updates on arXiv.org
linkpost comment

AWiMA: An architecture for Adhoc Wireless Mobile internet Access. (arXiv:0907.2252v1 [cs.NI]) [Jul. 15th, 2009|07:44 am]

AWiMA: An architecture for Adhoc Wireless Mobile internet Access. (arXiv:0907.2252v1 [cs.NI])

This paper suggests a system architecture for wireless widearea- networking access using adhoc networking between a mobile Client node without direct connectivity to a wirelesswide- area-network and a mobile Service Provider node with connectivity to a wireless-wide-area-network. It provides a means for securely providing such adhoc wireless networking services using a Server for tunneling and routing, registration and authentication. The architecture also provides support for handoff of a Client node from one Service Provider to another with persistence of a tunnel between the Client and the Server enabling a soft-handoff. Different wireless protocols may be used for adhoc networking, with filtered interconnection of authenticated Clients implemented at a Service Provider node. The architecture is applicable across different wide-areanetwork protocols, and provides simultaneous support for multiple wide-area-network protocols.


read more at cs updates on arXiv.org
linkpost comment

Sequential Posted Pricing and Multi-parameter Mechanism Design. (arXiv:0907.2435v1 [cs.GT]) [Jul. 15th, 2009|07:44 am]

Sequential Posted Pricing and Multi-parameter Mechanism Design. (arXiv:0907.2435v1 [cs.GT])

We consider the classical mathematical economics problem of {\em Bayesian optimal mechanism design} where a principal aims to optimize a given objective when allocating resources to self-interested agents. We show that for general product distributions on agent preferences and resource allocation problems that satisfy matroid properties (e.g., multi-unit auctions, matchings, spanning trees), sequential posted price mechanisms, where agents are approached in-turn and offered a pre-computed take-it-or-leave-it offer, are at most a 4-approximation to the optimal single-round mechanism. Notably, the analysis of this sequential posted price mechanism can be extended to give approximation mechanisms for the unsolved multi-parameter setting. In stark contrast to the single-parameter setting, in multi-parameter settings there is no general description or tractable implementation of optimal mechanisms. For decades, this unanswered issue has been widely considered one of the most important in the economic theory on mechanism design. We focus on the unit-demand special case where each agent has a different value for each resource or service but desires at most one. Our second result is that for general product distributions on unit-demand preferences and matroid problems, an easy-to-compute VCG-type mechanism is an 8-approximation to the optimal (deterministic) mechanism. Our exposition focuses on the objective of profit maximization, however, the approaches generalize to other objectives that are linear in valuations and payments.


read more at cs updates on arXiv.org
linkpost comment

An Energy-Based Comparison of Long-Hop and Short-Hop Routing in MIMO Networks. (arXiv:0808.0037v4 [c [Jul. 15th, 2009|07:44 am]

An Energy-Based Comparison of Long-Hop and Short-Hop Routing in MIMO Networks. (arXiv:0808.0037v4 [cs.IT] UPDATED)

This paper considers the problem of selecting either routes that consist of long hops or routes that consist of short hops in a network of multiple-antenna nodes, where each transmitting node employs spatial multiplexing. This distance-dependent route selection problem is approached from the viewpoint of energy efficiency, where a route is selected with the objective of minimizing the transmission energy consumed while satisfying a target outage criterion at the final destination. Deterministic line networks and two-dimensional random networks are considered. It is shown that when 1) the number of hops traversed between the source and destination grows large or 2) when the target success probability approaches one or 3) when the number of transmit and/or receive antennas grows large, short-hop routing requires less energy than long-hop routing. It is also shown that if both routing strategies are subject to the same delay constraint, long-hop routing requires less energy than short-hop routing as the target success probability approaches one. In addition, numerical analysis indicates that given loose outage constraints, only a small number of transmit antennas are needed for short-hop routing to have its maximum advantage over long-hop routing, while given stringent outage constraints, the advantage of short-hop over long-hop routing always increases with additional transmit antennas.


read more at cs updates on arXiv.org
linkpost comment

Factorization of Joint Probability Mass Functions into Parity Check Interactions. (arXiv:0901.3056v2 [Jul. 15th, 2009|07:43 am]

Factorization of Joint Probability Mass Functions into Parity Check Interactions. (arXiv:0901.3056v2 [cs.IT] UPDATED)

We show that any joint probability mass function (PMF) can be expressed as a product of parity check factors and factors of degree one with the help of some auxiliary variables, if the alphabet size is appropriate for defining a parity check equation. In other words, marginalization of a joint PMF is equivalent to a soft decoding task as long as a finite field can be constructed over the alphabet of the PMF. In factor graph terminology this claim means that a factor graph representing such a joint PMF always has an equivalent Tanner graph. We provide a systematic method based on the Hilbert space of PMFs and orthogonal projections for obtaining this factorization.


read more at cs updates on arXiv.org
linkpost comment

A Duality View of Boosting Algorithms. (arXiv:0901.3590v2 [cs.LG] UPDATED) [Jul. 15th, 2009|07:43 am]

A Duality View of Boosting Algorithms. (arXiv:0901.3590v2 [cs.LG] UPDATED)

We study boosting algorithms from a new perspective. We show that the Lagrange dual problems of AdaBoost, LogitBoost and soft-margin LPBoost with generalized hinge loss are all entropy maximization problems. By looking at the dual problems of these boosting algorithms, we show that the success of boosting algorithms can be understood in terms of maintaining a better margin distribution by maximizing margins and at the same time controlling the margin variance.We also theoretically prove that, approximately, AdaBoost maximizes the average margin, instead of the minimum margin. The duality formulation also enables us to develop column generation based optimization algorithms, which are totally corrective. We show that they exhibit almost identical classification results to that of standard stage-wise additive boosting algorithms but with much faster convergence rates. Therefore fewer weak classifiers are needed to build the ensemble using our proposed optimization technique.


read more at cs updates on arXiv.org
linkpost comment

Termination of the Sequence of SDS Sets and Machine Decision for Positive Semi-definite Forms. (arXi [Jul. 15th, 2009|07:43 am]

Termination of the Sequence of SDS Sets and Machine Decision for Positive Semi-definite Forms. (arXiv:0904.4030v2 [cs.SC] UPDATED)

Employing the concept of termination of sequence of SDS sets to describe the positive semi-definite property of a form, we establish a necessary and sufficient condition for deciding whether a given form on $\mathcal{R}^n_+$ is positive semi-definite or not, and show that, for a form which is (strictly) positive definite on $\mathcal{R}^n_+$, the corresponding sequence of SDS sets is positively terminating.

The above results are exactly constructed as follows: We define the column stochastic mean matrix first, and then prove that if we choose countable infinite matrices from finite $n\times n$ column stochastic mean ones at random (repeats allowed), then the product of these infinite matrices will converge to a column stochastic mean matrix with rank 1. Finally, we show the proof for relations between termination of the sequence of SDS sets and positive semi-definite property of a form.

The Maple program TSDS3, based upon these results, not only automatically prove the polynomial inequalities, but also output counter examples for those false. This method is verified to be very efficient and better than P\`olya method.


read more at cs updates on arXiv.org
linkpost comment

Bits Through Relay Cascades with Half-Duplex Constraint. (arXiv:0906.1599v2 [cs.IT] UPDATED) [Jul. 15th, 2009|07:43 am]

Bits Through Relay Cascades with Half-Duplex Constraint. (arXiv:0906.1599v2 [cs.IT] UPDATED)

Consider a relay cascade, i.e. a network where the source node, the sink node and a certain number of intermediate relay nodes are arranged in a line. We assume that adjacent node pairs are connected by error-free (q+1)-ary pipes. The following communication scenario is treated. The source and a subset of the relays wish to communicate independent information to a common sink under the condition that each relay in the cascade is half-duplex constrained. We introduce a simple channel model for half-duplex constrained links and provide a coding scheme which transfers information by an information-dependent, non-deterministic allocation of the transmission and reception slots of the relays. The coding scheme requires synchronization on the symbol level through a shared clock. In the case of a relay cascade with a single source, the coding strategy is capacity achieving. Numerical values for the capacity of cascades of various lengths are provided, and it turns out that the capacities are significantly higher than the rates which are achievable with a deterministic time-sharing approach. If the cascade includes a source and a certain number of relays with their own information, the strategy achieves the cut-set bound when the rates of the relay sources fall below individual thresholds. Hence, a partial characterization of the boundary of the capacity region follows. For cascades composed of an infinite number of half-duplex constrained relays and a single source, we derive an explicit capacity expression. Remarkably, the capacity for q=1 is equal to the logarithm of the golden ratio. We finally show that the proposed coding strategy is superior to network coding in the case of the wireless, half-duplex constrained butterfly network.


read more at cs updates on arXiv.org
linkpost comment

Spectrum sensing by cognitive radios at very low SNR. (arXiv:0907.1992v2 [cs.IT] UPDATED) [Jul. 15th, 2009|07:43 am]

Spectrum sensing by cognitive radios at very low SNR. (arXiv:0907.1992v2 [cs.IT] UPDATED)

Spectrum sensing is one of the enabling functionalities for cognitive radio (CR) systems to operate in the spectrum white space. To protect the primary incumbent users from interference, the CR is required to detect incumbent signals at very low signal-to-noise ratio (SNR). In this paper, we present a spectrum sensing technique based on correlating spectra for detection of television (TV) broadcasting signals. The basic strategy is to correlate the periodogram of the received signal with the a priori known spectral features of the primary signal. We show that according to the Neyman-Pearson criterion, this spectral correlation-based sensing technique is asymptotically optimal at very low SNR and with a large sensing time. From the system design perspective, we analyze the effect of the spectral features on the spectrum sensing performance. Through the optimization analysis, we obtain useful insights on how to choose effective spectral features to achieve reliable sensing. Simulation results show that the proposed sensing technique can reliably detect analog and digital TV signals at SNR as low as -20 dB.


read more at cs updates on arXiv.org
linkpost comment

SMT-Based Bounded Model Checking for Embedded ANSI-C Software. (arXiv:0907.2072v2 [cs.SE] UPDATED) [Jul. 15th, 2009|07:43 am]

SMT-Based Bounded Model Checking for Embedded ANSI-C Software. (arXiv:0907.2072v2 [cs.SE] UPDATED)

Propositional bounded model checking has been applied successfully to verify embedded software but is limited by the increasing propositional formula size and the loss of structure during the translation. These limitations can be reduced by encoding word-level information in theories richer than propositional logic and using SMT solvers for the generated verification conditions. Here, we investigate the application of different SMT solvers to the verification of embedded software written in ANSI-C. We have extended the encodings from previous SMT-based bounded model checkers to provide more accurate support for finite variables, bit-vector operations, arrays, structures, unions and pointers. We have integrated the CVC3, Boolector, and Z3 solvers with the CBMC front-end and evaluated them using both standard software model checking benchmarks and typical embedded applications from telecommunications, control systems and medical devices. The experiments show that our approach can analyze larger problems and substantially reduce the verification time.


read more at cs updates on arXiv.org
linkpost comment

An asymptotically tight bound on the number of semi-algebraically connected components of realizable [Jul. 15th, 2009|07:43 am]

An asymptotically tight bound on the number of semi-algebraically connected components of realizable sign conditions. (arXiv:math/0603256v3 [math.CO] UPDATED)

We prove an asymptotically tight bound (asymptotic with respect to the number of polynomials for fixed degrees and number of variables) on the number of semi-algebraically connected components of the realizations of all realizable sign conditions of a family of real polynomials. More precisely, we prove that the number of semi-algebraically connected components of the realizations of all realizable sign conditions of a family of $s$ polynomials in $\R[X_1,...,X_k]$ whose degrees are at most $d$ is bounded by \[ \frac{(2d)^k}{k!}s^k + O(s^{k-1}). \] This improves the best upper bound known previously which was \[ {1/2}\frac{(8d)^k}{k!}s^k + O(s^{k-1}). \] The new bound matches asymptotically the lower bound obtained for families of polynomials each of which is a product of generic polynomials of degree one.


read more at cs updates on arXiv.org
linkpost comment

An Augmented Lagrangian Approach for Sparse Principal Component Analysis. (arXiv:0907.2079v1 [math.O [Jul. 15th, 2009|07:43 am]

An Augmented Lagrangian Approach for Sparse Principal Component Analysis. (arXiv:0907.2079v1 [math.OC] CROSS LISTED)

Principal component analysis (PCA) is a widely used technique for data analysis and dimension reduction with numerous applications in science and engineering. However, the standard PCA suffers from the fact that the principal components (PCs) are usually linear combinations of all the original variables, and it is thus often difficult to interpret the PCs. To alleviate this drawback, various sparse PCA approaches were proposed in literature [15, 6, 17, 28, 8, 25, 18, 7, 16]. Despite success in achieving sparsity, some important properties enjoyed by the standard PCA are lost in these methods such as uncorrelation of PCs and orthogonality of loading vectors. Also, the total explained variance that they attempt to maximize can be too optimistic. In this paper we propose a new formulation for sparse PCA, aiming at finding sparse and nearly uncorrelated PCs with orthogonal loading vectors while explaining as much of the total variance as possible. We also develop a novel augmented Lagrangian method for solving a class of nonsmooth constrained optimization problems, which is well suited for our formulation of sparse PCA. We show that it converges to a feasible point, and moreover under some regularity assumptions, it converges to a stationary point. Additionally, we propose two nonmonotone gradient methods for solving the augmented Lagrangian subproblems, and establish their global and local convergence. Finally, we compare our sparse PCA approach with several existing methods on synthetic, random, and real data, respectively. The computational results demonstrate that the sparse PCs produced by our approach substantially outperform those by other methods in terms of total explained variance, correlation of PCs, and orthogonality of loading vectors.


read more at cs updates on arXiv.org
linkpost comment

navigation
[ viewing | most recent entries ]
[ go | earlier ]

Advertisement