| Betti numbers of semi-algebraic sets defined by partly quadratic systems of polynomials. (arXiv:0708 |
[Aug. 29th, 2007|07:23 pm] |
Betti numbers of semi-algebraic sets defined by partly quadratic systems of polynomials. (arXiv:0708.3522v1 [math.AG] CROSS LISTED)
Let $\R$ be a real closed field, $ {\mathcal Q} \subset
\R[Y_1,...,Y_\ell,X_1,...,X_k], $ with $ \deg_{Y}(Q) \leq 2, \deg_{X}(Q) \leq
d, Q \in {\mathcal Q}, #({\mathcal Q})=m,$ and $ {\mathcal P} \subset
\R[X_1,...,X_k] $ with $\deg_{X}(P) \leq d, P \in {\mathcal P}, #({\mathcal
P})=s$, and $S \subset \R^{\ell+k}$ a semi-algebraic set defined by a Boolean
formula without negations, with atoms $P=0, P \geq 0, P \leq 0, P \in {\mathcal
P} \cup {\mathcal Q}$. We prove that the sum of the Betti numbers of $S$ is
bounded by $(\ell s m d)^{O(m+k)}$. This is a common generalization of previous
results known before on bounding the Betti numbers of closed semi-algebraic
sets defined by polynomials of degree $d$ and 2, respectively.
We also describe algorithms for computing the Euler-Poincar\'e
characteristic, as well as all the Betti numbers of such sets. The complexity
of the first algorithm is bounded by $(\ell s m d)^{O(m(m+k))}$, while that of
the second is bounded by $(\ell s m d)^{2^{O(m+k)}}$.
read more at cs updates on arXiv.org |
|
|
| Comparison of Tree-Child Phylogenetic Networks. (arXiv:0708.3499v1 [q-bio.PE] CROSS LISTED) |
[Aug. 29th, 2007|07:23 pm] |
Comparison of Tree-Child Phylogenetic Networks. (arXiv:0708.3499v1 [q-bio.PE] CROSS LISTED)
Phylogenetic networks are a generalization of phylogenetic trees that allow
for the representation of non-treelike evolutionary events, like recombination,
hybridization, or lateral gene transfer. In this paper, we present and study a
new class of phylogenetic networks, called tree-child phylogenetic networks,
where every non-extant species has some descendant through mutation. We provide
an injective representation of these networks as multisets of vectors of
natural numbers, their path multiplicity vectors, and we use this
representation to define a distance on this class and to give an alignment
method for pairs of these networks. To the best of our knowledge, they are
respectively the first true distance and the first alignment method defined on
a meaningful class of phylogenetic networks strictly extending the class of
phylogenetic trees. Simple, polynomial algorithms for reconstructing a
tree-child phylogenetic network from its path multiplicity vectors, for
computing the distance between two tree-child phylogenetic networks, and for
aligning a pair of tree-child phylogenetic networks, are provided, and they
have been implemented as a Perl package and a Java applet, and they are
available at this http URL
read more at cs updates on arXiv.org |
|
|
| Cost-minimising strategies for data labelling : optimal stopping and active learning. (arXiv:0708.12 |
[Aug. 29th, 2007|07:23 pm] |
Cost-minimising strategies for data labelling : optimal stopping and active learning. (arXiv:0708.1242v2 [cs.LG] UPDATED)
Supervised learning deals with the inference of a distribution over an output
or label space $\CY$ conditioned on points in an observation space $\CX$, given
a training dataset $D$ of pairs in $\CX \times \CY$. However, in a lot of
applications of interest, acquisition of large amounts of observations is easy,
while the process of generating labels is time-consuming or costly. One way to
deal with this problem is {\em active} learning, where points to be labelled
are selected with the aim of creating a model with better performance than that
of an model trained on an equal number of randomly sampled points. Furthermore,
given a fixed set of labelled examples, one may use {\em semi-supervised}
learning methods to discover regularities in the data using the unlabelled
examples. In contrast to these two approaches, this paper proposes to deal with
the labelling cost directly: The learning goal is defined as the minimisation
of a cost which is a function of the expected model performance and the total
cost of the labels used. This allows the development of general strategies and
specific algorithms for (a) optimal stopping, where the expected cost dictates
whether label acquisition should continue, (b) active learning, where the
sampling is guided by the expected cost, (c) empirical evaluation, where the
cost is used as a performance metric for a given combination of inference,
stopping and sampling methods. Though the main focus of the paper is optimal
stopping, we also aim to provide the background for further developments and
discussion in the related field of active learning.
read more at cs updates on arXiv.org |
|
|
| Relative-Error CUR Matrix Decompositions. (arXiv:0708.3696v1 [cs.DS]) |
[Aug. 29th, 2007|07:23 pm] |
Relative-Error CUR Matrix Decompositions. (arXiv:0708.3696v1 [cs.DS])
Many data analysis applications deal with large matrices and involve
approximating the matrix using a small number of ``components.'' Typically,
these components are linear combinations of the rows and columns of the matrix,
and are thus difficult to interpret in terms of the original features of the
input data. In this paper, we propose and study matrix approximations that are
explicitly expressed in terms of a small number of columns and/or rows of the
data matrix, and thereby more amenable to interpretation in terms of the
original data. Our main algorithmic results are two randomized algorithms which
take as input an $m \times n$ matrix $A$ and a rank parameter $k$. In our first
algorithm, $C$ is chosen, and we let $A'=CC^+A$, where $C^+$ is the
Moore-Penrose generalized inverse of $C$. In our second algorithm $C$, $U$, $R$
are chosen, and we let $A'=CUR$. ($C$ and $R$ are matrices that consist of
actual columns and rows, respectively, of $A$, and $U$ is a generalized inverse
of their intersection.) For each algorithm, we show that with probability at
least $1-\delta$: $$ ||A-A'||_F \leq (1+\epsilon) ||A-A_k||_F, $$ where $A_k$
is the ``best'' rank-$k$ approximation provided by truncating the singular
value decomposition (SVD) of $A$. The number of columns of $C$ and rows of $R$
is a low-degree polynomial in $k$, $1/\epsilon$, and $\log(1/\delta)$. Our two
algorithms are the first polynomial time algorithms for such low-rank matrix
approximations that come with relative-error guarantees; previously, in some
cases, it was not even known whether such matrix decompositions exist. Both of
our algorithms are simple, they take time of the order needed to approximately
compute the top $k$ singular vectors of $A$, and they use a novel, intuitive
sampling method called ``subspace sampling.''
read more at cs updates on arXiv.org |
|
|
| Verified Real Number Calculations: A Library for Interval Arithmetic. (arXiv:0708.3721v1 [cs.MS]) |
[Aug. 29th, 2007|07:23 pm] |
Verified Real Number Calculations: A Library for Interval Arithmetic. (arXiv:0708.3721v1 [cs.MS])
Real number calculations on elementary functions are remarkably difficult to
handle in mechanical proofs. In this paper, we show how these calculations can
be performed within a theorem prover or proof assistant in a convenient and
highly automated as well as interactive way. First, we formally establish upper
and lower bounds for elementary functions. Then, based on these bounds, we
develop a rational interval arithmetic where real number calculations take
place in an algebraic setting. In order to reduce the dependency effect of
interval arithmetic, we integrate two techniques: interval splitting and taylor
series expansions. This pragmatic approach has been developed, and formally
verified, in a theorem prover. The formal development also includes a set of
customizable strategies to automate proofs involving explicit calculations over
real numbers. Our ultimate goal is to provide guaranteed proofs of numerical
properties with minimal human theorem-prover interaction.
read more at cs updates on arXiv.org |
|
|
| A Non Parametric Model for the Forecasting of the Venezuelan Oil Prices. (arXiv:0708.3829v1 [cs.CE]) |
[Aug. 29th, 2007|07:24 pm] |
A Non Parametric Model for the Forecasting of the Venezuelan Oil Prices. (arXiv:0708.3829v1 [cs.CE])
A neural net model for forecasting the prices of Venezuelan crude oil is
proposed. The inputs of the neural net are selected by reference to a dynamic
system model of oil prices by Mashayekhi (1995, 2001) and its performance is
evaluated using two criteria: the Excess Profitability test by Anatoliev and
Gerko (2005) and the characteristics of the equity curve generated by a trading
strategy based on the neural net predictions.
-----
Se introduce aqui un modelo no parametrico para pronosticar los precios del
petroleo Venezolano cuyos insumos son seleccionados en base a un sistema
dinamico que explica los precios en terminos de dichos insumos. Se describe el
proceso de recoleccion y pre-procesamiento de datos y la corrida de la red y se
evaluan sus pronosticos a traves de un test estadistico de predictibilidad y de
las caracteristicas del Equity Curve inducido por la estrategia de compraventa
bursatil generada por dichos pronosticos.
read more at cs updates on arXiv.org |
|
|
| Design Strategies for the Geometric Synthesis of Orthoglide-type Mechanisms. (arXiv:0708.3809v1 [cs. |
[Aug. 29th, 2007|07:24 pm] |
Design Strategies for the Geometric Synthesis of Orthoglide-type Mechanisms. (arXiv:0708.3809v1 [cs.RO])
The paper addresses the geometric synthesis of Orthoglide-type mechanism, a
family of 3-DOF parallel manipulators for rapid machining applications, which
combine advantages of both serial mechanisms and parallel kinematic
architectures. These manipulators possess quasi-isotropic kinematic
performances and are made up of three actuated fixed prismatic joints, which
are mutually orthogonal and connected to a mobile platform via three
parallelogram chains. The platform moves in the Cartesian space with fixed
orientation, similar to conventional XYZ-machine. Three strategies have been
proposed to define the Orthoglide geometric parameters (manipulator link
lengths and actuated joint limits) as functions of a cubic workspace size and
dextrous properties expressed by bounds on the velocity transmission factors,
manipulability or the Jacobian condition number. Low inertia and intrinsic
stiffness have been set as additional design goals expressed by the minimal
link length requirement. For each design strategy, analytical expressions for
computing the Orthoglide parameters are proposed. It is showed that the
proposed strategies yield Pareto-optimal solutions, which differ by the
kinematic performances outside the prescribed Cartesian cube (but within the
workspace bounded by the actuated joint limits). The proposed technique is
illustrated with numerical examples for the Orthoglide prototype design.
read more at cs updates on arXiv.org |
|
|
| A Radio Resource Management strategy for downlink cooperation in distributed networks. (arXiv:0708.3 |
[Aug. 29th, 2007|07:24 pm] |
A Radio Resource Management strategy for downlink cooperation in distributed networks. (arXiv:0708.3764v1 [cs.IT])
Downlink cooperation between Base Stations is a simple, efficient alternative
to macrodiversity for providing QoS continuity during mobility in distributed
cellular networks. It brings cooperation diversity and only requires data
forwarding between Base Stations. This paper presents a strategy for downlink
cooperation triggering, resource allocation and power control in an OFDMA-based
system. Power control aims at maximizing the equivalent capacity that is
achieved thanks to the diversity brought by relaying, while taking into account
inter-cell interference. We use an iterative method which dedicates a
proportion of the total power to relayed users. Simulation results show that
our method increases capacity at any load. It importantly reduces the rejection
probability thanks to SIR increase of users at cell's border. The paper also
shows that cooperation should be limited to cell-border users for restricting
additional resource consumption, and that it should make use of diversity as
much as possible.
read more at cs updates on arXiv.org |
|
|
| Searching for a dangerous host: randomized vs. deterministic. (arXiv:0708.3734v1 [cs.DC]) |
[Aug. 29th, 2007|07:24 pm] |
Searching for a dangerous host: randomized vs. deterministic. (arXiv:0708.3734v1 [cs.DC])
A Black Hole is an harmful host in a network that destroys incoming agents
without leaving any trace of such event. The problem of locating the black hole
in a network through a team of agent coordinated by a common protocol is
usually referred in literature as the Black Hole Search problem (or BHS for
brevity) and it is a consolidated research topic in the area of distributed
algorithms. The aim of this paper is to extend the results for BHS by
considering more general (and hence harder) classes of dangerous host. In
particular we introduce rB-hole as a probabilistic generalization of the Black
Hole, in which the destruction of an incoming agent is a purely random event
happening with some fixed probability (like flipping a biased coin). The main
result we present is that if we tolerate an arbitrarily small error probability
in the result then the rB-hole Search problem, or RBS, is not harder than the
usual BHS. We establish this result in two different communication model,
specifically both in presence or absence of whiteboards non-located at the
homebase. The core of our methods is a general reduction tool for transforming
algorithms for the black hole into algorithms for the rB-hole.
read more at cs updates on arXiv.org |
|
|
| Parametric Stiffness Analysis of the Orthoglide. (arXiv:0708.3723v1 [cs.RO]) |
[Aug. 29th, 2007|07:24 pm] |
Parametric Stiffness Analysis of the Orthoglide. (arXiv:0708.3723v1 [cs.RO])
This paper presents a parametric stiffness analysis of the Orthoglide, a
3-DOF translational Parallel Kinematic Machine. First, a compliant modeling of
the Orthoglide is conducted based on an existing method. Then stiffness matrix
is symbolically computed. This allows one to easily study the influence of the
geometric design parameters on the matrix elements. Critical links are
displayed. Cutting forces are then modeled so that static displacements of the
Orthoglide tool during slot milling are symbolically computed. Influence of the
geometric design parameters on the static displacements is checked as well.
Other machining operations can be modeled. This parametric stiffness analysis
can be applied to any parallel manipulator for which stiffness is a critical
issue.
read more at cs updates on arXiv.org |
|
|
|
|